WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 ; 

C12Q 1/68, C12P 19/34, C07H 21702, 
21/04 



Al 



(11) International Publication Number: WO 98/53103 

(43) International Publication Date: 26 November 1998 (26.) 1.98) 



(21) International Application Number: PCT/US98/1056I 

(22) International Filing Date; 21 May 1998 (21.05,98) 



(30) Priority Data: 

08/859,998 
09/053,375 



21 May 1997 (21.05.97) US 
31 March 1998 (3L03.98) US 



(71) Applicant (for all designated States except US): CLONTECH 

LABORATORIES, INC. [US/US]; 1020 East Meadow Cir- 
cle, Palo Alto, CA 94303 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CHENCHIK, Alex 
[RU/USJ; 670 San Antonio Road #30, Palo Alto, CA 94306 
(US). JOKHADZE, George [GE/US]; 360 Chiqutta Avenue 
#9, Mountain View, CA 94041 (US). BIBILASHVILLI, 
Robert [RU/RUI; 43 Kutuzovsky Prospect #85, Moscow, 
121170 (RU). 

(74) Agent; FIELD, Bret, E.; Bozicevic & Reed LLP, 200, 
285 Hamilton Avenue, Palo Alto, CA 94301 (US; 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR 
BY, CA, CH, CN, CU t CZ, DE, DK, EE, ES, FT, GB, GE, 
GH, GM, GW, HU, ID, IL, IS, JP ( KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT t LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE t SG, SI, SK, SL, 
TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: NUCLEIC ACID ARRAYS 
(57) Abstract 

Arrays of polynucleotide spots and kits comprising the same, as well as methods for their preparation and use are provided. The 
subject arrays include a plurality of polynucleotide spots stably associated with the surface of a solid support At least a portion of the 
polynucleotide spots comprises a polynucleotide probe composition that is made up of unique polynucleotides, where ail of the unique 
polynucleotides of the array correspond to a common type of gene. Also provided are sets of a representational number of gene specific 
primers suitable for use in generating target nucleic acid for use with the subject arrays. The subject arrays find use in hybridization assays, 
particularly in assays for the identification of differential gene expression patterns among two or more different types of cells. 



Document AJ 

Cited in IDS for CLON-017US1 
Serial No. .&\ 1*15 2, 2^3 
filed December 28, 2000 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL Albania 

AM Armenia 

AT Austria 

AO Australia 

AZ Azerbaijan 

BA Bosnia and Herzegovina 

BB Barbados 

BE Belgium 

BF Burkina Faso 

BG Bulgaria 

Bi Benin 

BR Brazil 

BY Belarus 

CA Canada 

CF Central African Republic 

CG Congo 

CH Switzerland 

CI C6te d'lvoire 

CM Cameroon 

CN China 

CU Cuba 

CZ Czech Republic 

DE Germany 

OK Denmark 

EE Estonia 



ES 


Spain 


LS 


Lesotho 


SI 


Ft 


Finland 


LT 


Lithuania 


SK 


FR 


France 


LU 


Luxembourg 


SN 


GA 


Gabon 


LV 


Latvia 


sz 


GB 


United Kingdom 


MC 


Monaco 


TD 


GE 


Georgia 


MO 


Republic of Moldova 


TG 


GH 


Ghana 


MG 


Madagascar 


TJ 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


GK 


Greece 




Republic of Macedonia 


TR 


HU 


Hungary 


ML 


Malt 


TT 


IE 


Ireland 


MN 


Mongolia 


UA 


IL 


ferae) 


MR 


Mauritania 


UG 


IS 


Iceland 


MW 


Malawi 


US 


IT 


Italy 


MX 


Mexico 


uz 


JP 


Japan 


NE 


Niger 


VN 


KE 


Kenya 


NL 


Netherlands 


YU 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


KP 


Democratic People's 


NZ 


New Zealand 






Republic of Korea 


PL 


Poland 




KR 


Republic of Korea 


PT 


Portugal 




KZ 


Kazakstan 


RO 


Romania 




LC 


Saint Lucia 


RU 


Russian Federation 




LI 


Liechtenstein 


SD 


Sudan 




LK 


Sri Lanka 


SB 


Sweden 




LR 


Liberia 


SC 


Singapore 





Slovenia 

Slovakia 

Senegal 

Swaziland 

Chad 

Togo 

Tajikistan 

Turkmenistan 

Turkey 

Trinidad and Tobago 

Ukraine 

Uganda 

United States of America 

Uzbekistan 

Viet Nam 

Yugoslavia 

Zimbabwe 



WO 98/53103 



PCT/US98/10561 



NUCLEIC ACID ARRAYS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of application serial no. 08/859,998 filed on 
May 21 , 1997 and application serial no. 09/053,375 filed on March 31, 1998, the disclosures 
of which are herein incorporated by reference. 

INTRODUCTION 

Technical Field 

The field of this invention is biopoiymeric arrays. 

Backgrou nd of the Invention 

"Biochips" or arrays of binding agents, such as oligonucleotides and peptides, have 
become an increasingly important tool in the biotechnology industry and related fields. 
These binding agent arrays, in which a plurality of binding agents are deposited onto a solid 
support surface in the form of an array or pattern, find use in a variety of applications, 
including drug screening, nucleic acid sequencing, mutation analysis, and the like. One 
important use of biochips is in the analysis of differential gene expression, where the 
expression of genes in different cells, normally a cell of interest and a control, is compared 
and any discrepancies in expression are identified. In such assays, the presence of 
discrepancies indicates a difference in the classes of genes expressed in the cells being 
compared. 

In methods of differential gene expression, arrays find use by serving as a substrate to 
which is bound polynucleotide "probe" fragments. One then obtains ''targets" from 
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analogous cells, tissues or organs of a healthy and diseased organism. The targets are then 
hybridized to the immobilized set of polynucleotide "probe" fragments. Differences between 
the resultant hybridization patterns are then detected and related to differences in gene 
expression in the two sources. 

A variety of different array technologies have been developed in order to meet the 
growing need of the biotechnology industry, as evidenced by the extensive number of 
patents and references listed in the relevant literature section below. 

Despite the wide variety of array technologies currently in preparation or available on 
the market, there is a continued need to identify new array devices to meet the needs of 
specific applications. Of particular interest would be the development of an array capable of 
providing high throughput analysis of differential gene expression. 

Relevant Literature 

Patents and patent applications describing arrays of biopolymeric compounds and 
methods for their fabrication include: 5,242,974; 5,384,261; 5,405,783; 5,412,087; 
5,424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5,554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,71 1; 5,639,603; 5,658,734; WO 
93/17126; WO 95/1 1995; WO 95/35505; EP 742 287; and EP 799 897. 

Patents and patent application describing methods of using arrays in various 
applications include: 5,143,854; 5,288,644; 5,324,633; 5,432,049; 5,470,710; 5,492,806; 
5,503,980; 5,510,270; 5,525,464; 5,547,839; 5,580,732; 5,661,028; WO 95/21265; WO 
96/31622; WO 97/10365; WO 97/27317; EP 373 203; and EP 785 280. 

Other references of interest include: Atlas Human cDNA Expression Array I (April 
1997) CLONTECHniques XII: 4-7; Lockhart et aL, Nature Biotechnology (1996) 14: 1675- 
1680; Shena et aL, Science (1995) 270: 467-470; Schenaet aL Proc. Nat'l Acad. Sci. USA 
(1996)93:10614-10619; Shaion et aL Genome Res. (1996) 6: 639-645; Milosavljevic et al., 
Genome Res. (1996) 6:132-141; Nguyen et aL, Genomics (1995)29: 207-216; Pietu et aL, 
Genome Res. (1996) 6: 492-503; Zhao et aL, Gene (1995) 166:207-213; Chalifour et aL, 
Anal. Biochem. (1994) 216:299-304; Heller et aL, Proc. Nat'l Acad. Sci. USA (1997) 94: 
2150-2155; and Schena, M„ Bio Assays (1996) 18: 427-431. 
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SUMMARY OF THE INVENTION 
Arrays of polynucleotide spots stably associated with the surface of a solid support 
and kits comprising the same, as well as methods for their preparation and use in 
hybridization assays, are provided. The subject arrays comprise a plurality of polynucleotide 
spots, wherein each different polynucleotide spot is made up of a polynucleotide probe 
composition and at least a portion of the polynucleotide probe compositions are made up of 
unique polynucleotides. The arrays are further characterized in that all of the unique 
polynucleotides on the array correspond to the same type of gene. The subject arrays find 
particular use in differential gene expression analysis. Also provided are sets of a 
representational number of gene specific primers useful in generating target nucleic acids for 
use with the subject arrays in hybridization assays. 

BRIEF DESCRIPTION OF THE HOT JJ3FS 
Fig. 1 provides a representation of an array according to the subject invention. 

DEFINITIONS 

The term "nucleic acid" as used herein means a polymer composed of nucleotides, 
e.g. deoxyribonucleotides or ribonucleotides. 

The terms "ribonucleic acid" and "RNA" as used herein mean a polymer composed 
of ribonucleotides. 

The terms "deoxyribonucleic acid" and "DNA" as used herein mean a polymer 
composed of deoxyribonucleotides. 

The term "oligonucleotide" as used herein denotes single stranded nucleotide 
multimers of from about 10 to 100 nucleotides in length. 

The term "polynucleotide" as used herein refers to single or double stranded polymer 
composed of nucleotide monomers of greater than about 120 nucleotides in length up to 
about 1 000 nucleotides in length. 

The term "array type" refers to the type of gene represented on the array by the 
unique polynucleotides, where the type of gene that is represented on the array is dependent 
on the intended purpose of the array, e.g. to monitor expression of key human genes, to 
monitor expression of known oncogenes, etc, i.e. the use for which the array is designed. As 
such, all of the unique polynucleotides on a given array correspond to the same type or 
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category or group of genes. Genes are considered to be of the same type if they share some 
common linking characteristics, such as: species of origin, e.g. human, mouse, rat, etc.; 
tissue or cell type of origin, e.g. muscle, neural, dermal, organ, etc.; disease state, e.g. cancer; 
functions, e.g. protein kinases, tumor supressors and the like, participation in the same 
normal biological process, e.g. apoptosis, signal transduction, cell cycle regulation, 
proliferation, differentiation etc.; and the like. For example, one array type that is provided 
below is a "cancer array" in which each of the "unique" polynucleotide probes correspond to 
a gene associated with a cancer disease state. Likewise, a "human array" may be an array of 
polynucleotides corresponding to unique tightly regulated human genes. Similarly, an 
"apoptosis array" may be an array type in which the polynucleotides correspond to unique 
genes associated with apoptosis. 

The "unique" polynucleotide sequences associated with each type of array of the 
present invention are sequences which are distinctive or different with respect to every other 
polynucleotide sequence on the array and correspond to the same type of gene, as defined 
above. For example, in a cancer array, each unique polynucleotide has a sequence that is not 
homologous to any other known cancer associated sequence. Moreover, each polynucleotide 
sequence on the array is statistically chosen to ensure that the probability of homology to any 
sequence of that type is very low. Morever, in the cancer array embodiment, all sequences 
are statistically chosen to insure that the probability of homology to any other sequence 
associated with cancer or of human origin is very low. An important feature of the individual 
polynucleotide probe compositions of the subject arrays is that they are only a fragment of 
the entire cDNA of the gene to which they correspond. In other words, for each gene 
represented on the array, the entire cDNA sequence the gene is not represented on the array. 
Instead, the sequence of only a portion or fragment of the entire cDNA is represented on the 
25 array by this unique polynucleotide. 

The term "polynucleotide probe composition" refers to the nucleic acid composition 
that makes up each of the spots on the array. Thus, the term " polynucleotide probe 
composition" includes nucleic acid compositions of unique polynucleotides and control or 
calibrating polynucleotides (e.g. polynucleotides corresponding to housekeeping genes). The 
polynucleotide compositions are made up of single stranded polynucleotides (i.e. 
polynucleotides that are not hybridized to each other), where all of the polynucleotides in the 
probe composition may be identical to each other or there may be two different 
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polynucleotides (polynucleotides of different nucleotide sequence) in each probe 
composition, where the two different polynucleotides are complementary to each other. 

The term "gene specific primer" means a polynucleotide of sufficient length to 
specifically hybridize to a distinct nucleic acid member of the sample, e.g. RNA or cDNA, 
where the length of the gene specific primers will usually be at least 8 nt, more usually at 
least 20 nt and may be as long as 25 nt or longer, but will usually not exceed 50 nt. The gene 
specific primers of the subject invention are sufficiently specific to hybridize to 
complementary template sequence during the generation of labeled nucleic acids under 
conditions sufficient for first strand cDNA synthesis, which conditions are known by those 
of skill in the art. The number of mismatches between the gene specific primer sequences 
and their complementary template sequences to which they hybridize during the generation 
of labeled nucleic acids in the subject methods will generally not exceed 20 %, usually will 
not exceed 10 % and more usually will not exceed 5 %, as determined using the FASTA 
program using default settings. 

DESCRIPTION OF THR SPECIFIC RMBODIMENTS 
Arrays of polynucleotide spots and methods for their preparation are provided. In the 
subject arrays, a plurality of polynucleotide spots is stably associated with the surface of a 
solid support, where at least a portion of the polynucleotide spots on the array are made up 
of unique polynucleotides and all of the unique polynucleotides of the array correspond to 
one particular type of gene, e.g. tightly regulated human genes, genes associated with a 
particular disease state, genes associated with cell cycle regulation, etc. The subject arrays 
find particular use in gene expression assays. Also provided are sets of a representational 
number of gene specific primers useful in generating target nucleic acids for use with the 
subject arrays. In further describing the subject invention, the arrays first will be described in 
general terms. Next, methods for their preparation are described. Following this, a 
description of representative specific array types falling within the scope of the invention 
will be provided. Finally, a review of representative applications in which the subject arrays 
may be employed will be provided, where this review includes a description of the sets of a 
representational number of gene specific primers according to the subject invention. 
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Before the subject invention is further described, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
variations of the particular embodiments may be made and still fall within the scope of the 
appended claims. It is also to be understood that the terminology employed is for the purpose 
5 of describing particular embodiments, and is not intended to be limiting. Instead, the scope 
of the present invention will be established by the appended claims. 

In this specification and the appended claims, the singular forms "a," "an," and "the" 
include plural reference unless the context clearly dictates otherwise. Unless defined 
10 otherwise, all technical and scientific terms used herein have the same meaning as 

commonly understood to one of ordinary skill in the art to which this invention belongs. 

Arrays of the Subject Invention-General Description 

1 5 Array Structure 

The arrays of the subject invention have a plurality of polynucleotide spots stably 
associated with a surface of a solid support. Each spot on the array comprises a 
polynucleotide sample, Le. polynucleotide probe composition, of known identity, usually of 
known sequence, as described in greater detail below; The polynucleotide spots on the array 

20 may be any convenient shape, but will typically be circular, eliiptoid, oval or some other 
analogously curved shape. The density of the spots on the solid surface is at least about 
5/cnr and usually at least about 10/cnr but does not exceed about 1000/cnrr, and usually 
does not exceed about 500/cnr, and more usually does not exceed about 300/cnr. The spots 
may be arranged in any convenient pattern across or over the surface of the array, such as in 

25 rows and columns so as to form a grid, in a circular pattern, and the like, where generally the 
pattern of spots will be present in the form of a grid across the surface of the solid support. 
See Fig. 1. 

In the subject arrays, the spots of the pattern are stably associated with the surface of 
a solid support, where the support may be a flexible or rigid solid support. By stably 
30 associated is meant that the polynucleotides of the spots maintain their position relative to 
the solid support under hybridization and washing conditions. As such, the polynucleotide 
members which make up the spots can be non-covalently or covalently stably associated 
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with the support surface. Examples of non-covalent association include non-specific 
adsorption, binding based on electrostatic (e.g. ion, ion pair interactions), hydrophobic 
interactions, hydrogen bonding interactions, specific binding through a specific binding pair 
member covalently attached to the support surface, and the like. Examples of covalent 
5 binding include covalent bonds formed between the spot polynucleotides and a functional 
group present on the surface of the rigid support, e.g. -OH, where the functional group may 
be naturally occurring or present as a member of an introduced linking group, as described in 
greater detail below. 

As mentioned above, the array is present on either a flexible or rigid substrate. By 
10 flexible is meant that the support is capable of beL ^ bent, folded or similarly manipulated 

without breakage. Examples of solid materials which are flexible solid supports with respect 

to the present invention include membranes, e.g. nylon, flexible plastic films, and the like. 

By rigid is meant that the support is solid and does not readily bend i.e. the support is not 

flexible. As such, the rigid substrates of the subject arrays are suffu tert to provide physical 
1 5 support and structure to the polymeric targets present thereon under the assay conditions in 

which the array is employed, particularly under high throughput handling conditions. 

Furthermore, when the rigid supports of the subject invention are bent, they are prone to 

breakage. 

The solid supports upon which the subject patterns of spots are presented in the 
20 subject arrays may take a variety of configurations ranging from simple to complex, 

depending on the intended use of the array. Thus, the substrate could have an overall slide or 
plate configuration, such as a rectangular or disc configuration. In many embodiments, the 
substrate will have a rectangular cross-sectional shape, having a length of from about 10 mm 
to 200 mm, usually from about 40 to 150 mm and more usually from about 75 to 125 mm 
25 and a width of from about 1 0 mm to 200 mm, usually from about 20 mm to 1 20 mm and 
more usually from about 25 to 80 mm, and a thickness of from about 0.01 mm to 5.0 mm, 
usually from about 0.1 mm to 2 mm and more usually from about 0.2 to 1 mm. 

The substrates of the subject arrays may be fabricated from a variety of materials. 
The materials from which the substrate is fabricated should ideally exhibit a low level of 
30 non-specific binding during hybridization events. In many situations, it will also be 

preferable to employ a material that is transparent to visible and/or U V light. For flexible 
substrates, materials of interest include: nylon, both modified and unmodified, nitrocellulose. 
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polypropylene, and the like, where a nylon membrane, as well as derivatives thereof, is of 
particular interest in this embodiment. For rigid substrates, specific materials of interest 
include: glass; plastics, e.g. polytetrafluoroethylene, polypropylene, polystyrene, 
polycarbonate, and blends thereof, and the like; metals, e.g. gold, platinum, and the like; etc. 
5 The substrates of the subject arrays comprise at least one surface on which the pattern 

of spots is present, where the surface may be smooth or substantially planar, or have 
irregularities, such as depressions or elevations. The surface on which the pattern of spots is 
present may be modified with one or more different layers of compounds that serve to 
modify the properties of the surface in a desirable manner. Such modification layers, when 
10 present, will generally range in thickness from a monomolecular thickness to about 1 mm, 
usually from a monomolecular thickness to about 0. 1 mm and more usually from a 
monomolecular thickness to about 0,001 mm. Modification layers of interest include: 
inorganic and organic layers such as metals, metal oxides, polymers, small organic 
molecules and the like. Polymeric layers of interest include layers of: peptides, proteins, 
1 5 polynucleic acids or mimetics thereof, e.g. peptide nucleic acids and the like; 

polysaccharides, phospholipids, polyurethanes, polyesters, polycarbonates, polyureas, 
polyamides, polyethyieneamines, polyarylene sulfides, polysiloxanes, polyimides, 
polyacetates, and the like, where the polymers may be hetero- or homopolymeric, and may 
or may not have separate functional moieties attached thereto, e.g. conjugated. 
20 The total number of spots on the substrate will vary depending on the number of 

different polynucleotide probes one wishes to display on the surface, as well as the number 
of control spots, calibrating spots and the like, as may be desired depending on the particular 
application in which the subject arrays are to be employed. Generally, the pattern present on 
the surface of the array will comprise at least about 10 distinct spots, usually at least about 
25 20 distinct spots, and more usually at least about 50 distinct spots, where the number of spots 
may be as high as 10,000 or higher, but will usually not exceed about 5,000 distinct spots, 
and more usually will not exceed about 3,000 distinct spots. In many embodiments, it is 
preferable to have each distinct probe composition presented in duplicate, i.e. so that there 
are two spots for each distinct polynucleotide probe composition of the array. In certain 
30 embodiments, the number of spots will range from about 200 to 600. 

The amount of polynucleotide present in each spot will be sufficient to provide for 
adequate hybridization and detection of target nucleic acid during the assay in which the 

-8- 
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array is employed. Generally, the amount of polynucleotide in each spot will be at least 
about 0. 1 ng, usually at least about 0.5 ng and more usually at least about I ng, where the 
amount may be as high as 1000 ng or higher, but will usually not exceed about 20 ng and 
more usually wilt not exceed about 10 ng. The copy number of each polynucleotide in a spot 
5 will be sufficient to provide enough hybridization sites for target molecule to yield a 
detectable signal, and will generally range from about 0.01 fmol to 50 fmol, usually from 
about 0,05 fmol to 20 fmol and more usually from about 0.1 fmol to 5 fmol. Where the spot 
has an overall circular dimension, the diameter of the spot will generally range from about 
10 to 5,000 /^m, usually from about 20 to 2,000 ^m and more usually from about 50 to 1000 
10 /^m. 

A critical feature of the subject arrays is that at least a portion, usually the majority, 
of the polynucleotide spots on the array are made up of polynucleotide probes that all 
correspond to the same kind or kind of gene, i.e* genes that all share some common 
characteristic or can be grouped together based on some common feature, such as species of 

1 5 origin, tissue or cell of origin, functional role, disease association, etc. Other spots which 
may be present in the pattern include spots comprising genomic DNA, housekeeping genes, 
negative and positive control genes, and the like. These latter types of spots comprise 
polynucleotides that are not "unique" as that term is defined and used herein, i.e. they are 
"common." In other words, they are calibrating or control genes whose function is not to tell 

20 whether a particular "key" gene of interest is expressed, but rather to provide other useful 
information, such as background or basal level of expression, and the like. The percentage 
of spots which are made of unique polynucleotides that correspond to the same type of gene 
is generally at least about 30 number %, and usually at least about 60 number % and more 
usually at least about 80 number %. Therefore, the arrays of the subject invention will be of a 

25 specific array type, where representative array types include: human arrays, mouse arrays, 
cancer arrays, apoptosis arrays, human stress arrays, oncogene and tumor suppressor arrays, 
cell-cell interaction arrays, cytokine and cytokine receptor arrays, rat arrays, blood arrays, 
mouse stress arrays, neuroarrays, and the like, where some of these representative arrays are 
described in greater detail below. 

30 With respect to the polynucleotide probes that correspond to a particular type or kind 

of gene, type or kind can refer to a plurality of different characterizing features, where such 
features include: species specific genes, where specific species of interest include eukaryotic 
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species, such as mice, rats, rabbits, pigs, primates, humans, etc.; function specific genes, 
where such genes include oncogenes, apoptosis genes, cytokines, receptors, protein kinases, 
etc.; genes specific for or involved in a particular biological process, such as apoptosis, 
differentiation, cell cycle regulation, cancer, aging, proliferation, etc.; location specific 
5 genes, where locations include organ, such as heart, liver, prostate, lung etc., tissue, such as 
nerve, muscle, connective, etc., cellular, such as axonal, lymphocytic, etc. or subcellular 
locations, e.g. nucleus, endoplasmic reticulum, Golgi complex, endosome, lyosome, 
peroxisome, mitochondria, cytoplasm, cytoskeleton, plasma membrane, extracellular space; 
specific genes that change expression level over time, e.g. genes that are expressed at 

1 0 different levels during the progression of a disease condition, such as prostate genes which 
are induced or repressed during the progression of prostate cancer. 

The average length of the polynucleotides on the array is chosen to be of sufficient 
length to provide a strong and reproducible signal, as well as tight and robust hybridization. 
As such, the average length of the polynucleotides of the array will typically range from 

15 about 120 to 1000 nt and usually from about 120 to 800 nt, where in many embodiments, the 
average length ranges from about 200 to 700 nt, and usually 200 to 600 nt. The length of 
each polynucleotide on the array is less than the length of the mRNA to which it 
corresponds. As such, the polynucleotide represents only a fraction of the full length cDNA 
to which it corresponds. 

20 As mentioned above, the subject arrays typically comprise one or more additional 

spots of polynucleotides which do not correspond to the array type, i.e. the type or kind of 
gene represented on the array. In other words, the array may comprise one or more spots that 
are made of non "unique ' polynucleotides, i.e common polynucleotides. For example, spots 
comprising genomic DNA may be provided in the array, where such spots may serve as 

25 orientation marks. Spots comprising plasmid and bacteriophage genes, genes from the same 
or another species which are not expressed and do not cross hybridize with the cDN A target, 
and the \ikc y may be present and serve as negative controls. In addition, spots comprising 
housekeeping genes and other control genes from the same or another species may be 
present, which spots serve in the normalization of mRNA abundance and standardization of 

30 hybridization signal intensity in the sample assayed with the array. 



-10- 



WO 98/53103 



PCT/US98/10561 



Polynucleotide Probes of the Arrays 

Each spot of the pattern present on the surface of the substrate is made up of a unique 
polynucleotide probe composition. By "polynucleotide probe composition" is meant a 
collection or population of single stranded polynucleotides capable of participating in a 
5 hybridization event under appropriate hybridization conditions, where each of the individual 
polynucleotides may be the same - have the same nucleotide sequence™ or different 
sequences, for example the probe composition may consist of 2 different single stranded 
polynucleotides that are complementary to each other (i.e. the two different polynucleotides 
in the spot are complementary but physically separated so as to be single stranded, Le, not 

1 0 hybridized to each other). In many embodiments, the probe compositions will comprise two 
complementary, single stranded polynucleotides. 

In those polynucleotide probe compositions having unique polynucleotides, the 
sequence of the polynucleotides are chosen in view of the type and the intended use of the 
array on which they are present. The unique polynucleotides are chosen so that each distinct 

1 5 unique polynucleotide does not cross-hybridize with any other distinct unique polynucleotide 
on the array, i.e. the polynucleotide of any other polynucleotide probe composition that 
corresponds to a different gene falling within the broad category or type of genes represented 
on the array. As such, the nucleotide sequence of each unique polynucleotide of a probe 
composition will have less than 90% homology, usually less than 85 % homology, and more 

20 usually less than 80% homology with any other different polynucleotide of a probe 

composition of the array, where homology is determined by sequence analysis comparison 
using the FASTA program using default settings. The sequence of unique polynucleotides in 
the probe compositions are not conserved sequences found in a number of different genes (at 
least two), where a conserved sequence is defined as a stretch of from about 40 to 200 

25 nucleotides which have at least about 90% sequence identity, where sequence identity is 
measured as above. The polynucleotide will generally be a deoxyribonucleic acid having a 
length of from about 120 to 1000, usually from 120 to 700 nu and more usually 200 to 600 
nt The polynucleotide will not cross-hybridize with any other polynucleotide on the array 
under standard hybridization conditions. Again, the length of the polynucleotide will be 

30 shorter than the mRNA to which it corresponds. 
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Array Preparation 

The subject arrays can be prepared using any convenient means. One means of 
preparing the subject arrays is to first synthesize the polynucleotides for each spot and then 
deposit the polynucleotides as a spot on the support surface. The polynucleotides may be 
5 prepared using any convenient methodology, such as automated solid phase synthesis 
protocols, preparative PCR and like, where preparative PCR or enzymatic synthesis is 
preferred in view of the length and the large number of polynucleotides that must be 
generated for each array. 

For preparative PCR, primers flanking either side of the portion of the gene of 

10 interest will be employed to produce amplified copy numbers of the portion of interest. 

Methods of performing preparative PCR are well known in the art, as summarized in PCR, 
Essential Techniques (Ed. J.F. Burke, John Wiley & Sons)(i996). Alternatively, if a gene 
fragment of interest is cloned into a vector, vector primers can be used to amplify the gene 
fragment of interest to produce the polynucleotide. 

15 in determining the portion of the gene to be amplified and subsequently placed on the 

array, regions of the gene having a sequence unique to that gene should preferably be 
amplified. Different methods may be employed to choose the specific region of the gene to 
be amplified. Thus, one can use a random approach based on availability of a gene of 
interest. However, instead of using a random approach which is based on availability of a 

20 gene of interest, a rational design approach may also be employed to choose the optimal 
sequence for the hybridization array. Preferably, the region of the gene that is selected and 
amplified is chosen based on the following criteria. First, the sequence that is chosen should 
yield a polynucleotide that does not cross-hybridize with any other polynucleotide that is 
present on the array. Second, the sequence should be chosen such that the polynucleotide has 

25 a low probability of cross-hybridizing with a polynucleotide having a nucleotide sequence 
found in any other gene, whether or not the gene is to be represented on the array from the 
same species of origin, e.g. for a human array, the sequence will not be homologous to any 
other human genes. As such, sequences that are avoided include those found in: highly 
expressed gene products, structural RNAs, repeated sequences found in the sample to be 

30 tested with the array and sequences found in vectors. A further consideration is to select 

sequences which provide for minimal or no secondary structure, structure which allows for 
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optimal hybridization but low non-specific binding, equal or similar thermal stabilities, and 
optimal hybridization characteristics. 

The prepared polynucleotides may be spotted on the support using any convenient 
methodology, including manual techniques, e.g. by micro pipette, Inkjet, pins, etc., and 
5 automated protocols. Of particular interest is the use of an automated spotting device, such 
as the Beckman Biomek 2000 (Beckman Instruments). As mentioned above, the 
polynucleotide probe compositions that are spotted onto the array surface are made up of 
single stranded polynucleotides, where all the polynucleotides may be identical to each other 
or a population of complementary polynucleotides may be present in each spot. 

10 

Specific Array Types of the Subject Invention 

A variety of specific array types are also provided by the subject invention. As 
discussed above, array type refers to the nature of the polynucleotide probes present on the 
1 5 array and the types of genes to which the probes correspond. These array types include: 

human array; mouse array; cancer array, apoptosis array, human stress array, oncogene and 
tumor suppressor arrray, cell-cell interaction array, and cytokine and cytokine receptor array, 
as well as other types of arrays, e.g. rat array, rat stress array, blood array, mouse stress 
array, and nueroarray. Each of these arrays is described separately below. 

20 

Human Array 

One specific array type provided by the subject invention is the human array. In the 
human array of the subject invention, the majority of the spots on the array have a 
polynucleotide sequence corresponding to a human gene of interest. As such, all of the 

25 unique polynucleotide probes on the array correspond to human genes. The human genes 
represented on the human array are typically those genes that have been identified by those 
of skill in the art as key genes. By "key** is meant that the genes are relevant and related to 
the purpose of the array, e.g. the identification of difference in the expression profiles of 
different cell or tissue types, where the key genes are generally functionally important to the 

30 cell. In many embodiments, the genes represented on the human array are tightly regulated 
human genes. The term "tightly regulated gene 1 is used herein in accordance with its art 
accepted definition and use. As such, by tightly regulated human gene is meant a gene which 
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is not "leaky," as opposed to housekeeping genes which are generally expressed at similar 
levels in different cells and different tissues, i.e. a gene which is inducible such that in 
response to a specific inducing signal the gene turns "on" and when this signal is removed, 
the gene turns "off." 

5 In certain embodiments of the human array, human genes that may be represented on 

the subject arrays include: (a) oncogenes & tumor suppressors; (b) cell cycle regulators; (c) 
stress response proteins; (d) ion channel & transport proteins; (e) intracellular signal 
transduction modulators and effectors; (f) apoptosts-related proteins; (g) DNA synthesis, 
repair and recombination proteins; (h) transcription factors & general DNA binding proteins; 
10 (i) growth factor & chemokine receptors; Q) in^rleukin & interferon receptors; (k) hormone 
receptors; (1) neurotransmitter receptors; (m) cell surface antigens & cell adhesion proieins; 
(n) growth factors, cytokines and chemokines; (o) interleukins & interferons; (p) hormones; 
(q) extracellular matrix proteins; (r) cytoskeleton & motility proteins; (s) RNA processing & 
turnover proems; (t) post-translationai modification, trafficking & targeting proteins; (u) 
1 5 protein turnover; and (v) metabolic pathway proteins. 

In view of the length of the polynucleotides of the probe compositions of the spots, 
each polynucleotide of a probe composition typically has a nucleotide sequence of only a 
portion of the human gene. Specific sequences to which the polynucleotide sequence may 
correspond include those identified in Table 1 below, where by "correspond" is meant that 
20 the polynucleotide could have the same sequence as specified or a sequence complementary 
to the specified sequence. Whether the polynucleotide sequence is the same as a portion of 
the sense strand of the gene to which is corresponds or complementary thereto is based 
primarily on the nature of the target which the array is to be used, e.g. if the target is first 
strand cDNA, the polynucleotide will have a sequence found in the anti-sense DNA strand of 
25 the gene to which it corresponds. 

Of particular interest is a human array of the subject invention as shown in Fig. 1. In 
the array, each spot on the array comprises a known polynucleotide, as specified in Table I , 
where the array comprises spots which: (a) correspond to 588 different tightly regulated 
human genes; (b) comprise plasmid and bacteriophage polynucleotides; (c) comprise 
30 polynucleotides corresponding to housekeeping genes; and (d) genomic DNA. Each of the 
different types of polynucleotide spots are positioned at a known location on the membrane 
surface. 
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Position 


1410-1625 


391-586 


174-447 


1372-1594 


1990-2247 


181-436 


3886-4139 


3414-3904 


436-618 


3503-3856 


2232-2567 


2568-2880 


3359-3543 


920-1232 


1663-2125 


4382-4770 


11107-1473 


917-1208 


390-608 


216-459 


130-555 


607-879 


305-499 


947-1140 


2359-2823 


35-279 


345-730 


121-621 


338-595 


89-430 


2398-2575 


901-1232 


o 

CO 

m 

CM 


1981-2176 


690-964 


(1538-1878 


Gene Name 


inteiieukin*7 receptor (IL-7) I 


HUlFN-gamma interferon 


interleukin 7 (IL-7) 


c-myc oncogene 


interleukin-2 receptor 


interleukin-2 (IL-2) 


epidermal growth factor receptor (ERBB3) I 


insulin-like growth factor I receptor 


insulin-like growth factor II 


homo sapiens inducible nitric oxide synthase 


glutamate receptor subunit (GLUH1) I 


c-fms proto-oncogene 


tumor necrosis factor receptor 


p53-associated gene I 


platelet-derived growth factor B chain j 


transferrin receptor 1 


iinterieukin-1 precursor (PRE IL-1 ) I 


monocyte interleukin 1 (IL-1) 


[interleukin 3 (IL-3) 


[interleukin 4 (IL-4) 


interleukin BSF-2 (B-dell differentiation factor) 


Itumor necrosis factor 


[lymphotoxin (TNF-BETA) 


[T-cell surface glycoprotein T4 ! 


pnterleukin 6 receptor ! 


|T-cell replacing factor (interleukin-5) 


(interferon beta-1 (lFN-beta-1) 


I granulocyte-macrophage colony stimulating factor 


I transforming growth factor-alpha 


(leukocyte interferon (IFN-alpha) alpha-C 


transforming growth factor-beta (TGF-beta) 


(granulocyte colony-stimulating factor (G-CSF) 


1 nuclear factor kappa-B DNA binding subunit 


| nucleotide sequence of the c-myb cDNA clone lambda-LMC8 


p53 cellular tumor antigen 


(transforming growth factor beta-2 


GeneBank # I 


M29696 1 1 
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IM14743 I 
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1X04688 


IM28622 


IM11220 
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IJ00209 


X02812, J05114 
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IM19154.M22045.M22046 
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Table 1 (cont) 













































































i Position | 


i 41 64-4434 i 


2562-2740 I 


442-648 I 


3399-3748 | 


651-1116 


2641-2974 I 


555-959 I 


553-1012 | 


CO 
TT 
OJ 
CVJ 
t 

CVI 

o 

CVI 


934-1250 | 


1674-1006 I 


11879-2146 I 


1610-849 | 


[1281-1494 1 


[610-824 I 


11423-1740 I 


1508-921 I 


1817-1147 I 


11043-1452 1 


[885-1270 | 


I2297-2697 I 


I3358-3592 I 


(289-528 I 


I2448-2803 I 


I2762-3242 


I2442-2473 


11126-1363 


1958-1272 


11315-1633 


11873-2272 


13114-3536 


14236-4402 


1 858-560 


11262-1605 1 


572-976 


1449-1831 


Gene Name | 


kidney epidermal growth factor (EGF) precursor | 


interferon alpha receptor (HUIFN-ALPHA-REC) | 


interleukin 10 (IL10) | 


interleukin 2 receptor beta chain (P7Q-75) | 


interleukin 3 receptor (HIL-3RA) 


interleukin 4 receptor | 


interleukin 5 receptor alpha | 


interferon alpha/beta receptor | 


cytokine humig | 


cholinergic receptor nicotinic alfa polipeptide 3 | 


interleukin 2 receptor gamma polipeptide | 


androgen receptor | 


ciliary neurotropic factor receptor | 


adenosine receptor | 


interferon gamma receptor | 


erythropoietin receptor | 


Al adenosine receptor-adenyiate cyclase inhibitor | 


orphan hormone nuclear receptor 


adenosine receptor A3 


calcitonin receptor 


| coagulating factor II receptor 


|transforming growth factor beta receptor III 300 kDa 


| interleukin 9 receptor 


| interleukin 1 0 receptor I 


I nerve growth factor receptor 


| Vitronectin receptor alpha subunit 


CVJ 

c 

s 

CL 
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C 

*XJ 
a 

XI 

a 


I CCAAT-box DNA-binding protein Hap2 homolog 


1 retinoic acid receptor epsilon 


I B-myb 


(tyrosine kinase receptor tie 


[tyrosine kinase receptor FLT4 class III 


I helix-loop-heiix protein 1 R21 


| thrombomodulin 


basic transcription element-binding protein 2 


[basic transcription factor 62 kDa subunit 


GeneBank # 


X04671 I 


J03171 I 


M57627 I 


M26062 | 


M74782 


X52425 | 
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Position j 
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1855-2030 


2826-3144 


1587-1911 


1621-1886 


349-657 


3044-3302 


2686-3053 


1253-1475 


1676-2100 


1686-2063 
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359-765 


2621-2989 


1818-2029 
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1103-1398 


2607-3053 


1304-1736 


66-317 


96-341 


882-1057 


1449-1700 
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angiotensin ii type 1a receptor alt splice 1 I 


tyrosine kinase receptor HEK 


DNA-binding protein SMBP2 


global transcription activator I 


FMLP-related receptor I I 


transmembrane receptor rorl 


tyrosine kinase KDR receptor 


DNA-binding protein ICS 


thyroid hormone triiodothyronine receptor c-erbA ear-1 


CACCC-box DNA-binding protein 


tyrosine kinase receptor 


DNA-binding protein TAX I 


tyrosine kinase receptor TKT I 


prostaglandin E2 receptor I 


epidermal growth factor receptor 


platelet-activating factor receptor I 


tyrosine phosphatase receptor eph alt splice 1 I 


DNA-binding proiein/plasminogen activator inhibitor-1 regulator 


interferon gamma receptor I 


DNA-binding protein CN sterol regulating I 


v-erbA related ear-2 protein 


v-erbA related ear-3 protein 


i CCA AT displacement protein cut homolog alt splice 1 


Ibasic transcription factor 44 kDa subunit 


| DNA-binding protein APRF 


(estrogen receptor hSNF2b 


urokinase-type plasminogen activator receptor 


| Ets-lika gene 


(tyrosine kinase TRK-B receptor 


DNA-binding protein NFX1 cysteine-rich specific 


[ DP2 dimerization partner of E2F 


Iglia maturation factor beta 


I DNA-binding protein TAXREB67 


[cAMP-responsive element-binding protein 


tyrosine kinase EGF receptor Her4 


GeneBank# l ( 


HG1972 II 


HG2094 J 


HG209 1 
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HG244 I' 


HG2480 I 


HG2490 
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HG2811 i 


HG2869 I 
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HG3183 I 


HG3314 I 


L25124 I 


HG1187 I 


HG1662 I 
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IHG753 


HG859.L05515 
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1487-1845 


359-625 
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tyrosine phosphatase receptor gamma polypeptide j: 


DNA-binding protein PO-GA 
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interleukin 1 receptor I 


tumor necrosis factor receptor I 


macrophage-specific colony-stimulating factor (CSF-1 ) I 


insulin-like growth factor II receptor 


tyrosine kinase receptor HER2 I 


HM74 


HM89 


HM145 I 


hepatocyte growth factor activator precursor | 


hepatoma-derived growth factor I 


bone morphogenetic protein 4 I 


FER tyrosine kinase 1 


activation (Act-2) 1 


lendothelin ET3 I 


I neuroleukin 


|TEK tyrosine kinase receptor 


lendothelin receptor EDNRA 


lendothelin receptor EDNRB 


(interleukin 1L-1 3 


ICD40 liqand 


|CD27 ligand 


I cytokine receptor (EB13) 


I giial growth factor 2 (recombinant) 


[glial growth factor (recombinant) 


interleukin IL-14 


jthrombopoietin (MGDF/Mpl ligand) 


i insulin receptor 


uromodulin 


| RANTES pro-inflammatory cytokine 


| PDGF-alpha receptor 


iPDGF-beta receptor 


bone morphogenetic protein 1 
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567-997 1 


1458-1731 I 


243-704 | 


36-384 | 


747-1154 | 


833-1113 I 


511-837 I 


476-861 I 


320-584 | 


198-622 I 


680-1071 I 


713-1028 ] 


[1746-1967 I 


|1757-2152 I 


|602-847 | 


|205-397 I 


132-460 


|1453-1891 I 
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|1905-2146 I 


11549-1970 | 
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[1643-2015 


(2054-2328 


(3152-3421 


(40-342 


Gene Name 


bone morphogenetic protein 2a | 


bone morphogenetic protein 3 


macrophage inflammatory protein GOS1 9-1 | 


monocyte chemotactic and activating factor MCAF | 


neuronal growth protein GAP-43 | 


oncostatin M | 


amphiregulin AR | 


insuline-iike growth factor binding protein 1 | 


TNF-inducibie hyaiuronate-binding protein TSG-6 | 


heparin-binding vascular endothelial growth factor VEGF | 


insuline-like growth factor binding protein 2 
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bFGF receptor | 


glycoprotein gp1 30 


nerve growth factor HBNF-1 | 


i secreted protein I-309 | 


interleukin IL-11 


granulocyte colony-stimulating factor receptor G-CSFR1 I 


stem cell factor 


heparin-binding EGF-|ike growth factor | 


j HGF (hepatocyte growth factor) | 


keratinocyte growth factor | 


[brain-derived neurotrophic factor BDNF ] 


Igrowth/differentiation factor GDF-1 


|C5a anaphylatoxin receptor 


|T cell activation antigen CD27 


lendothelin ET2 


[interleukin 1L-1 2 (NKSFp40) 


linterleukin IL-12 (NKSFp35) 


[Fas antigen 


] interleukin 8 receptor alpha (IL8RA) 


I NMB-R (neuromedin B receptor) 


[hepatocyte growth factor-like protein 


| AXL tyrosine kinase receptor 


[lymphocyte activation antigen CD30 


thymosin beta-10 


GeneBank # 


M22489 I 


M22491 I 


M23452 I 


M24545 I 


M25667 I 


M27288 I 


M30704 I 


M31145 [ 
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M32977 I 


M35410 I 


M36717 l 
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M57230 | 
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Position 


1459-1748 | 


5090-1748 | 


1294-1712 | 


1760-1968 | 


1288-1604 I 


127-150 | 


m 

CO 
CO 
CVJ 

CO 
CVJ 


1053-1381 I 


1514-1799 


11362-1713 


OJ 
CO 

cp 
o> 

OJ 


1272-304 | 


|753-1189 | 


1 2007-2434 | 


[338-695 


333-740 


[1165-1559 


[280-613 


[37-430 | 


[16-254 | 


137-351 | 


|522-955 


[1810-2239 


[156-186 


|868-1173 


(1131-1502 


CO 

?f 

CVI 


[157-501 


[1098-1371 


[842-1244 


[198-605 


[230-533 


65-329 


329-657 


Gene Name 


connective tissue growth factor | 


tyrosine phosphatase receptor zeta-polypeptide | 


TDGF3 I 


RYK=related to receptor tyrosine kinase isolog [ 


VEGF receptor | 


Duffy blood group antigen (Fya-b+) j 


growth factor receptor tyrosine kinase STK-1 | 


interleukin 12 receptor component ] 


monocyte chemoattractant protein 1 receptor (MCP-1RA) alternatively 
spliced 


monocyte chemoattractant protein 1 receptor (MCP-1RB) alternatively 
spliced 


-o 

CO 

.£? 

CM 

W: 

CO 
UL 


endotheiiai-monocyte activating polypeptide 11 | 


keratinocyte growth factor receptor | 


cysteine protease CPP32 isom alpha 


interleukin IL-15 | 


activin type I receptor 


|VRP (vascular endotheiiai growth factor related protein) | 


[IFN-gamma-inducible chemokine 1P-10 ] 


s 

cn 
o 
o 
c 
o 
o 

2 
a. 

'SZ 

t 

o 


[MRP-14 (calcium binding protein in macrophages MIF-related) | 


[MRP-8 (calcium binding protein in macrophages MIF-reiated) | 


[platelet-derived growth factor A chain PDGF-A 


[leukemia inhibitory factor LIF 


linterleukin IL-9 (P40) 


[granuiocyte-macrophage colony-stimulating factor receptor GM-CSFRa 


[fibroblast growth factor FGF-1 


[nerve growth factor NGF-2 (same as NT-3) 


[macrophage inflammatory protein-2alpha (MIP2alpha) 


[PiGF (placenta growth factor) 


[interleukin 1 receptor type II 


|Cdw40 


[beta-thromboglobuiin-Iike protein 


neutrophil-activating peptide ENA-78 


rr 

CO 

a. 

x> 

c 
nj 
cn 

O 
X 

o 

CO 
CM 
CO 
CO 

IX 


GeneBank # 


M92934 | 


M93426 | 


CO 

to 

CD 
CO 

o> 


S59184 I 


U01134 | 


U01839 I 


U02687 I 


U03187 | 


OJ 

CO 
CO 
CO 

o 


U03905 


U04806 I 


U10117 I 


U11814 I 


U13737 I 


U14407 I 


U14722 


U43142 I 


X02530 | 


X06182 | 


[X06233 | 


|X06234 | 


1X06374 I 


1X13967 


1X17543 


1X17648 


[X51943 


|X53655 


[X53799 


IX54936 


[X59770 


1X60592 


1X72304 


X78686 
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Position | 


99-287 


1467-1817 I 


965-1175 I 


1069-1512 I 


867-1159 | 


2235-2577 | 


83-433 | 


o 

CO 
CO 
CVJ 

CM 


124-334 [ 


2132-2597 


706-980 | 


2367-2704 


00 

o 
a> 

o 
t 

XT 


13917-4258 | 


2810-3157 


11166-1481 


386-811 


1220-1599 


11416-1833 


983-1372 


178-392 


1772-2194 ] 


[842-1217 ' 


|708-1051 


438-762 


1113-1455 


1 355-674 


1107-555 


[2179-2631 


758-1184 


1818-1077 


|620-982 


CO 
CD 

ri- 

CM 
CO 
<D 


69-351 


(1897-2137 


348-736 


Gene Name 


monocyte-derived neutrophil chemotactic factor MDNCF 


protein kinase C delta-type | 


transcription factor E4TF1 -47 [ 


transcription factor E4TF1 -60 | 


recA-iike protein HsRad51 ( 


alpha-catenin | 


ld-1H I 


transcription factor AREB6 | 


DAD-1 I 


sky Sky 


BST-1 I 


ZFM1 protein I 


ZFM1 protein alternatively spliced product 


transcriptional activator hSNF2a I 


LIMK (LIM kinase) ] 


S 

Q 


; DNA-binding protein TAXREB302 


intercellular adhesion molecule- 1 (ICAM-1) I 


[transforming growth factor-beta 3 (TGF-beta3) 


erythroid differentiation protein (EDF) 


Isialophorin (CD43) 


[excision repair protein ERCC6 


[MAP kinase kinase 


| replication factor C 36-kDa subunit 


I replication lactor C 38-kDa subunit 


jachaete scute homologous protein (ASH1) 


[ moesin-ezrin-radixin-like protein 


Kruppei related zinc finger protein (HTF1 0) 


(focal adhesion kinase (FAK) 


jactivated p21cdc42Hs kinase (ack) 


|TR3 orphan receptor 


[transcription factor RZR-alpha 


[tight junction (zonula occludens) protein ZO-1 (tumor suppressor) 


|c-myc transcription factor (puf) 


|NF-kappa-B transcription factor p65 subunit 


natural killer cell enhancing factor (NKEFB) 


GeneBank # 1 


Y00787 


D10495 | 


D13316 J 


D13318 i 


D13804 I 


D13866 I 


ca 

00 
CO 
CO 

T— 

O 


D15050 I 


D15057 | 


D17517 


D21878 ! 


D26120 I 


D26121 I 


D26155 I 


D26309 I 


D28118 | 


[D28468 I 


J03132 I 


J03241 I 


IJ03634 I 


IJ04536 | 


IL04791 I 


IL05624 I 


IL07540 ! 


IL07541 


IL08424 


1111353 


IL11672 


IL13616 


1L13738 


IL13740 


[L14611 


co 

03 
— 1 


IL16785 


i 
| 

i 
t 

is 

i 


L19185 
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Position 


113-338 


1374-1538 


89-305 


2534-2802 


163-671 


3275-3583 


1677-2107 


3861-4236 


1073-1376 


227-278 


489-780 


969-1220 


496-722 


355-573 


415-729 


970-1283 


I454-755 


|115-354 | 


1221-1514 


262-726 


824-1120 


461-695 


367-667 1 


740-1071 


767-1106 


885-1141 


538-864 


288-591 


255-495 


59-310 


56-269 


322-621 


447-672 


1756-2180 


12804-3086 


(312-693 


Gene Name 


paired box homeotic protein (PAX8) I 


ERCC5 excision repair protein 1 


protein serine/threonine kinase stkl I 


protein serine/threonine kinase stk2 I 


14-3-3n protein | 


octamer binding transcription (actor 1 (OTF1 ) I 


S protein I 


plasma membrane calcium ATPase isoform 2 (ATP2B2) I 


guanine nucleotide regulatory protein (G1 3) I 


Bax beta I 


Rad I 


calcium/calmoduiin dependent protein kinase I 


CTLA4 counter-receptor (B7-2) I 


GRB2 isoform I 


nuclear factor l-X I 


protein kinase (MLK-3) I 


CDC2-re!ated protein kinase (PISSLRE) I 


c 

XI 

vs 

in 

a! 
k. 

2 

u 

JS 
c 

CO 

c 
o 

(D 
CD 

CO 

£ 

o 
cx 

< 

cc 


I autocrine motility factor receptor (AMFR) I 


mas proto-oncogene , 


[guanine nucleotide-binding protein G-s alpha subunit partial cds 


| MAL protein 


I homeobox d protein 


(differentiation antigen (CD1 9) 


| protein kinase C aipha-polypeptide (PKCA) 


Idifferentiatton antigen (CD33) 


[prothymosin alpha (ProT-alpha) 


I GTP-binding protein (RAB3A) 


jGTP-binding protein (RAB4) 


jGTP-binding protein (RAB6) 


I GTP-binding protein (RAB2) 


| GTP-binding protein (RAB3B) 


| GTP-binding protein (RAB5) 


MUC1 8 glycoprotein 


[stem cell protein (SCL) 


myelobiastin 


GeneBank # ]' 


L19606 1 1 


L20046 |i 


L20320 || 


L20321 l| 


L.20422 I 


L20433 I 


L20815 I 


L20977 I 


L22075 I 


L22474 I 


L24564 I 


L24959 I 


125259 


L29511 


L31881 I 


IL32976 I 


IL33264 I 


IL34587 I 


[L35233 


M13150 


IM14631 I 


M15800 I 


IM16937 


M21097 


M22199 


M23197 


IM26708 


IM28210 


1M28211 


IM28212 


IM28213 


M28214 


to 

T— 

CM 
CO 
CM 
Z> 


M28882 


1M29038 


M29142 
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Position 


1056-1450 


2098-2549 


2340-2764 


2285-2631 


2277-2685 


316-636 I 


2109-2466 


1729-1974 


53-354 


942-1299 


433-780 


561-843 


1226-1539 


443-789 


58-338 | 


CM 
T 

to 


950-1211 


893-1187 


963-1224 ! 


1158-1408 j 


740-1027 ! 


3626-3902 


640-668 


989-1276 


1018-1410 


1498-1838 


5680-6055 


2935-3238 


3197-3483 


175-433 


11205-1507 


709-990 


1209-1552 


706-1053 


407-769 


I3676-3984 


Gene Name 


vascular cell adhesion molecule 1 


endothelial leucocyte adhesion molecule I (ELAM1 ) 


Ku (p70/p80) subunit I 


papillary thyroid carcinoma-encoded protein 


transcription factor (E2A) j 


cyclic AMP response element-binding protein (HB16) 3* end j 


DNA repair heiicase (ERCC3) j 


Ku protein subunit I 


cell adhesion protein (SQM1 ) 


N-cadherin 


active transcription factor CREB j 


transcription factor !ID I 


DNA-repair protein (XRCC1 ) 


transducin beta-2 subunit I 


transducin beta-1 subunit 3* end 


| lymphoid-specif ic transcription factor I 


sequence-specific DNA-binding protein (AP-2) I 


h-pim-1 protein (h-pim-1) 


| B cell differentiation antigen 


| cell adhesion molecute (CD44) 


[neurofibromatosis protein type I (NF1) 


Icolorectal mutant cancer protein ! 


[mitochondrial transcription factor 1 


Itranscription factor ETR103 


transcription factor ETR101 


(replication protein A 70kDa subunit 


bullous pemphigoid antigen 


(transcriptional enhancer factor (TEF1) DNA 


{cell adhesion molecule L1 (L1CAM) 


|HHR6A (yeast RAD 6 homologue) 


Idipeptidyl peptidase IV (CD26) 


| sulfated glycoprotein-2 3'end 


(homeobox protein (HOX-1 1) 


LU 
LL. 

s. 

C 

*5 

2 
a. 
a> 
c 
~o 
c 
la 
< 

Q 


transcription factor (TFIIB) 


|HEB helix-loop-helix protein (HEB) 


GeneBank # 


M30257 I 


M30640 I 


M30938 |l 


M31213 Jl 


M31523 I 


M31630 t 


M31899 I 


M32865 I 


M33374 ] 


M34064 I 


M34356 [ 


M34960 I 


M36089 


M36429 I 


M36430 | 


M36542 I 


IM36711 I 


IM54915 I 


IM54992 I 


IM59040 I 


[M60915 I 


M62397 I 


M62810 ! 


IM62829 ! 


IM62831 


IM63488 


M63618 


IM63896 


IM74387 


IM74524 


IM74777 


IM74816 


IM75952 


1M76541 


M76766 


M80627 
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Position 1 


227-593 


549-873 


113-408 


946-1158 


790-1099 


643-1135 


882-1286 


LO 
LO 
CO 

CO 

cn 


1057-1520 


1718-1945 


892-1271 


728-1118 


1507-1752 


701-1068 


2446-2771 


1921-2226 


1091-1450 


1128-413 


98-409 I 


1079-1323 I 


1852-2185 


1430-1701 


2231-2569 


5023-5369 


1630-2062 


2039-2444 


1358-1681 


1212-1556 


125-538 


cn 

CO 
CO 

CM 
O 

o 

f 


1236-1522 


cn 

CO 

o 

CVJ 
CM 

o 
r>. 


CO 
CO 
CO 
CM 
* 

CO 

o 
o 

CVJ 


930-1207 


CO 

cvi 
o 


11828-2140 


Sene Name 


transcription elongation factor (Sll) |: 


mveloid cell nuclear differentiation antigen |; 


S19 ribosomal protein I 


NRL qene product I 


nuclease-sensitive element DNA-bind»ng protein I 


retinoid X receptor beta (RXR-beta) 


replication factor C 40-kDa subunit (A1 ) I 


replication factor C 37-kDa subunit 


IFN-responsive transcription factor subunit I 


homeobox 21 protein (HOX2A) I 


zinc finger transcriptional regulator 


FLU I 


follicle stimulating hormone receptor I 


nudeobindtn precursor | 


B-cell specific transcription factor (BSAP) I 


MAR/SAR DNA binding protein (SATB1) I 


(region 7) homeobox protein (HOX7) I 


5HT1 a=5-hydroxytryptamine receptor {transmembrane regions 5 and 6} I 


(transmembrane 4 superfamily protein (SAS) 


I guanine nucleotide regulatory protein (NET1) 


iquanine nucleotide regulatory protein (timi) j 


[clone ndf43 neu differentiation factor i 


|PAX3/forkhead transcription factor fusion 


ITFIIIC Box B-binding subunit 


[alpha palindromic binding protein 


Itumor suppressor (LUCA-1) 


Itranscription factor LSF 


Idioxtn-inducibfe cytochrome P450 (CYP1B1) 




IFUSE binding protein 


|p55CDC 


|cionepSK1 interferon gamma receptor accessory factor-1 (AF-1) 


(voltage-gated calcium channel beta subunit 


Imutant lymphocyte-specific protein tyrosine kinase (LCK) 


(amphiphysin 


epidermal growth factor receptor substrate (eps15) 


GeneBank # I 1 


M81601 1 


M81750 i 


M81757 |! 


M81840 I 


M83234 i 


M84820 


M87338 I 


M87339 I 


M87503 I 


M92299 I 


M92843 


M93255 I 


M95489 


M96824 


M96944 


M97287 I 


IM97676 I 


S64045 


U01160 


U02081 I 


U02082 


U02326 


U02368 


U02619 


U02683 


U03056 


U03494 


U03688 


U04847 


IU05040 


U05340 


IU05875 


U07139 


U07236 


U07616 


U07707 
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Position 


2735-3130 


2039-2374 


4657-4920 


1575-1928 


487-833 


1745-2063 


3556-3892 


967-1380 


2901-3146 


132-492 


2293-2645 


851-1218 


CM 
CO 
CM 

CO 

CM 
CM 


I665-942 


479-759 


1090-1403 


1816-2118 


3721-4093 


4002-4343 


2354-2662 


2224-2652 


2338-2646 


1866-2254 


1097-1383 


1970-2245 


749-1113 


543-856 


638-1008 


2138-2411 


2819-3135 


2123-2483 


123-427 


289-589 


3745-4027 


O 

3 

O 
CM 


(2514-2779 


Gene Name 


contactin 1 precursor (CNTN1) 1 


NF-ATc 1 


R kappa B 


transcription factor LCR-F1 I 


serine kinase 


melanoma differentiation associated (mda-6) I 


JAK family protein tyrosine kinase JAK3 I 


nuclear factor NF45 I 


nuclear factor NF90 I 


HOX A1 homeodomain protein (HOXA1) 


epidermal growth factor receptor kinase substrate (Eps8) 


positive regulator of programmed cell death ICH-1L (lch-1) 


homolog of Drosophiia discs large protein isofonm 1 (hdlg-1) I 


(ard-1) j 


LIM domain transcription factor LlM-1 (hLIM-1) I 




transcription factor IL-4 stat I 


DNA polymerase alpha-subunit 


IX chromsome CCG1 protein inv in cell proliferation 


| ski oncogene 


| sno oncogene snoN protein ski-related 


N-CAM (a nontransmembrane isoform) from skeletal muscle | 


| Wilms tumor WT1 zinc finger protein Krueppel-like 


| GATA-3 transcription factor 


|P120 antigen 


|ZFX put transcription activator isoform 1 


jproliferation-associated gene (pag) 


(MacMarcks 


ITRKE 


I desmoglein 2 


| angiotensin-converting enzyme (ACE) 


(relaxin H2 


renin-binding protein 


giutamate receptor type 1 subtype 5a 


[glucagon 


giutamate receptor 5 


GeneBank # ; 


U07819 I- 


U08015 I 


U08191 II 


U08853 |i 


U09564 J 


U09579 i 


U09607 I 


U10323 I 


U10324 I 


U10421 I 


U12535 ! 


U13021 


U13897 I 


U14575 J 


U14755 I 


U15979 I 


IU16031 I 


X06745 


X07024 


X15218 I 


X15219 j 


X16841 


1X51630 


X55122 


X55504 


1X59738 


1X67951 


1X70326 


1X74979 


1226317 


A00914 


IA06925 


D10232 


ID28538 


J04040 


L19058 
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Position 


B28-1183 


57-257 


2412-2783 


1226-1521 


451-744 


CO 
CO 

°9 
cn 

00 


CM 

CJ5 

<r™ 

cn 

05 
LO 


938-1239 


1093-1425 


853-1140 


o 

CO 
r» 

CM 


QO 

cn 

T— 

CM 
CO 
CO 


537-894 


563-855 


OO 

T— 

CM 
r— 

CO 
I s - 

00 


2388-2796 


12459-2883 


241-546 


1520-1821 


172-455 


1704-1989 


719-1057 


CM 

to 

CO 

o 

rr 
CM 
CO 


3427-3784 


o 

CM 
rr 

CM 


625-938 


504-906 


203-511 


257-583 


3932-4284 


893*1189 


1357-1605 


290-572 


454-745 


1365-1657 


1412-676 


Sene Name 1 


nhibin A-subunit H 


iiazepam binding inhibitor 1 


3eta-2-adrenergic receptor |l 


jopamined2 receptor [' 


growth hormone-dependent insulin-like growth factor-binding protetn 


retinoic acid-binding protein II 1 
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Mouse Array 

In the mouse array according to the subject invention, all of the unique 
polynucleotide probe compositions will correspond to a mouse gene of interest. Mouse genes 
that are represented on the array are key genes, by which is meant that they have been 

5 reported to play primary roles in a variety of different biological processes. Typically the 
mouse genes represented on the array are genes that are under tight transcriptional control. 
Genes of interest that may be represented on the array include: oncogenes, cell cycle genes, 
apoptosis genes, growth factor genes, cytokine genes, interleukin genes, receptor genes, and 
genes associated with different stages of embryonic development. 

1 0 In certain embodiments, of particular interest is an array having the following types 

of genes represented on its surface: oncogenes & tumor suppressors; cell cycle regulators; 
stress response proteins; ion channel & transport proteins; intracellular signal transduction 
modulators & effectors; apoptosis-related proteins; DNA synthesis, repair & recombination 
proteins; transcription factors & general DNA binding proteins; growth factor & chemokine 

1 5 receptors; interleukin & interferon receptors, hormone receptors; neurotransmitter receptors; 
cell-surface antigens & cell adhesion proteins; interleukins & interferons; cytoskeleton & 
motility proteins; and protein turnover. In a specific mouse array of interest, the spots are as 
listed in Table 2. 

20 The mouse array of the subject invention finds use in a variety of different 

applications, where such applications include: profiling differential gene expression in 
transgenic knockout mice or other experimental mouse models; investigating processes such 
as embryo genesis and tumorigenesis; discovering potential therapeutic and diagnostic drug 
targets; and the like. 
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franscription factor TF If D | 


10- 1 ; Tight junction protein; discs-large family member, partially 
nomologous to a dlg-A tumor suppressor in Drosophila/ 


ERCC5 excision repair protein; DNA-repair protein complementing XP-G 
sells (XPG) 


Bax; Bcl-2 heterodimerization partner and homologue | 


B7-2; T lymphocyte activation antigen CD86; CD28 antigen ligand 2 # B7-2 
antigen; alternative CTLA4 counter-receptor 
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PKC-alpha; protein kinase C alpha type | 


CD44 antigen I 


T-lymphocyte activated protein j 


Neuronal-cadherin (N-cadherin) I 


ATP-dependent DNA helicase It 70 kDa subunit; thyroid Ku (p707p80) 
autoantigen p70 subunit; p70 Ku) 


G13; G-alpha-13 guanine nucleotide regulatory protein ] 


Transcription factor RelB ' 


Vascular cell adhesion protein 1 


ERCC3 DNA repair helicase; DNA-repair protein complementing XP-B ceils 
(XPBC) 


|CRE-BP1; cAMP response element binding protein 1 


|XRCC1 DNA-repair protein, affecting ligation 


(Nuclear hormone receptor ROR-ALPHA-1 


cd 
c5 
c 
"5 

2 
a. 

CO 

CO 
1 


Prothymosin alpha 


|PAX-8 (paired box protein PAX 8) 


CamK IV; Ca2/caimodulin-dependent protein kinase IV (catalytic chain) 


ATP-dependent DNA helicase 11 80 kDa subunit; thyroid Ku (p70/p80) 
autoantigen p80 subunit; p80 Ku) 


|Ret proto-oncogene (Papillary thyroid carcinoma-encoded protein) 


Nm23-M2; nucleoside diphosphate kinase B; metastasis-reducing protein; 
c-myc-related transcription factor 


GenBank # h 


ID01034 T 


D14340 i 


D16306 ( 


IL22472 ]! 


L26606 


L27105 


IM13945 | 


IM20157 


IM25811 I 


IM27129 | 


IM31042 J 


IM31131 I 


M38700 


IM63660 I 


[M83380 ! 


IM84487 


to 

CO 
r— 


IS76657 


IU02887 


IU53228 


IU57311 


1X56135 


1X57487 
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LO 

X 
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CO ; CM CO 
CMi r-> CO 
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i 
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850-1113 | 


515-744 


404-709 I 


1592-1873 | 


3123-3426 | 


1317-1691 | 


2057-2411 I 


1205-1488 | 


723-1062 I 


647-884 | 


2153-2554 | 


1262-1563 I 


262-504 j 


80-357 I 


288-629 I 


832-1089 


I 

co 
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I160M910 1 


11101-1380 I 
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I 

to 
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CO 

in 

CO 
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CO 

u. 
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CD 
rr 
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CJ 
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D4a ] 


D7n | 
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! 
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™ 

Q 


|D7a 


CD 

CVJ 

o 


B2c 


A7g 


|E2j 


rr 
Q 


E7I 


|D3f 


Gene Name 


MAPKK6; MAP kinase kinase 6{dual specificity) (MKK6) j 


DNA polymerase alpha catalytic subunit (p180) | 


Caspase-3; Nedd2 cysteine protease (positive regulator of programmed 
cell death ICH-1 homoiogue) 


PSD-95/SAP90A | 


Angiotensin converting enzyme (ACE) (clone ACES.) 


Clusterin; complement lysis inhibitor; testosterone-repressed prostate 
message 2; apotipoprotein J; sulfated glycoprotein^ 


Adipocyte differentiation-associated protein | 


Epidermal growth factor receptor kinase substrate EPS8 | 


Jak3 tyrosine-protein kinase; Janus kinase 3 \ 


DesmocoHin 2 I 


Stat6; signal transducer and activator of transcription 6; IL-4 Stat; STA6 | 


3 

0) 
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CO 
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'Sc 
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2 

CL 
QJ 
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*« 
g 
>» 
o 

*u 
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Q. 
W 

CJ 

o 

sz 

CL 

B 

2 


ERA-1 Protein (ERA-1 -993) | 


Homeo Box protein 2.1 (Hox-2.1) | 


Zinc finger X-chromosomal protein (ZFX) | 


WT1 ; Wiims tumor protein; tumor suppressor 


Tristetraproiine | 


Nucleobindin 


PAX-5 (B cell specific transcription factor) | 


IFNgR2; interferon-gamma receptor second (beta) chain; interferon gammaj 
receptor accessory factor-1 (AF-1) I 


Transcriptional enhancer factor 1 (TEF-1) 


| Transcription factor NFAT 1, tsoform alpha 


|DNA-binding protein SATB1 


CCHB3; calcium channel (voltage-gated; dihydropyridine-sensitive; L-type) 
beta-3 subunit) 


p57kip2; cdk-inhibitor kip2 (cyclin-dependent kinase inhibitor 1B) member 
of the p2lCiP1 Cdk inhibitor family; candidate tumor suppressor gene 


IsnoN; ski-related oncogene 


(Homeo Box protein 7.1 (Hox-7.1) 


I Neuronal cell surface protein F3 


GATA-3 transcription factor 


GenBank # |i 


1X97052 |i 


[D17384 |i 


D28492 i 


ID50621 ll 


J04946 


L08235 


IL12721 I 


IL21671 j 


[L33768 | 


IL33779 I 


IL47650 I 


|M12056 | 


JM22115 I 


IM26283 | 


IM32309 I 


IM55512 


IM57422 ! 


IM96823 


IM97013 


S69336 


S74227 
U02079 


|U05252 


U20372 


U20553 


|U36203 


[X14759 


1X14943 


X55123 
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550-873 


51-294 J 


267-623 I 


1225-1477 


52 

up 
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CM 


51-392 


51-365 I 


1673-1934 


1051-1357 


520-870 J 


62-369 


CO 

■rr 
in 


13-263 


878-1145 I 


1042-1320 


452-758 


276-571 | 


781-1074 | 


484-790 I 


400-707 | 


1858-2205 | 


1082-1309 


606-986 I 


1189-484 | 


1729-2064 I 


[810-1117 I 


115-377 | 


(474-719 I 


(70M009 I 


[172-531 | 


101-363 


|658-957 I 


(288-589 I 


Array Coordinate 1 
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E7f |l 


X3 
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C7k 


C4i | 
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CO 
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E 
m 
O 


C5n I 


C2d 


c 

a 


C2b 


A4k | 


A3k 


A4n 


D6e J 


A6g 


A6h | 


CD 
UJ 
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a> 
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5 


S 
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e 


cn 

CM 

U. 


Clm 


|F2k 


|A6a 


|A5i 


ca 


JZ 
GQ 


E7n 


CO 

z 

s 


fB1 DNA binding protein |i 


Dipeptidyl peptidase iv I i 


Fti-1 ets-reiated proto-oncogene I- 


RXR-beta cis-1 1 -retinoic acid receptor 


C3H cytochrome P450; Cyp1 b1 I 


Ubiquitin-conjugating enzyme, yeast Rad6 homologue; murine HR6B 


Relaxin t 


Transcription factor LiM-1 i 


DNA topoisomerase I (Top I) I 


DNA topoisomerase li (Top 11) 1 


GST Pi 1; glutathione S-transferase Pi 1; preadipocyte growth factor 


Glutathione S-transferase A I 


Glutathione S-transferase Mu 1 


c-Abi proto-oncogene I 


0) 

c 

q> 

CJ 

c 
9 

o 

CL 

cr 
< 


c-Src proto-oncogene 


Rettnoic acid binding protein II ceilutar (CRABP-II) | 


Cvclin 02 (G1/S-specific) 


Cvclin D3 (G1/S-specific) I 


i 5-Hvdroxytryptamine receptor [Serotonin receptor type 2 (5HT2)] I 


ICvclin 01 fG1/S-specific) ! 


Pur-aipha transcriptional activator; sequence-specific ssDNA-binding 
protein 


Cdc25a; cdc25M1; MPI1 (M-phase inducer phosphatase 1) 


I ERCC-1; DNA excision repair protein 


lc-rel proto-oncogene 


| Inhibin alpha subunit 


| Glutathione reductase 


I Insulin-like growth factor binding protein-3 (IGFBP-3) 


I Cyciin A (G2/M-specific) 


iPreproglucagon 


NF-kB p65; NF-kappa-B transcription factor p65 subunit; rel-related 
polypeptide 


I PKC-theta; protein kinase C theta type 


|VLA-3 alpha subunit 


[GenBank* H 


X57621 


X58384 I 


X59421 i 


X66224 I 


1X78445 |i 


X96859 1 


IZ2708B I 


Z27410 


1D10061 I 


D12513 


030887 


IJ03958 I 


IJ04696 


IL10656 I 


M13071 


M17031 


IM35523 I 


M83749 ! 


U43844 


S49542 


S78355 


U02098 


U27323 


1X07414 


X15842 


1X69618 


1X76341 


1X81581 


Z26580 


Z46845 


cr> 
o 
o> 

5 
2 


ID11091 


01 3867 
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Table2(cont) 



Position I 


326-605 


273-523 


1816-2159 


1060-1429 


737-964 


2399-2713 


144-434 


641-906 


3262-3450 


538-750 


1-233 


255-551 


1838-2050 


4127-4476 


1893-2200 


1437-1723 


245-500 


645-984 


168-488 


o 

CO 

r- 
cS 


228-516 l 


1 1585-1 902 


1225-1466 


915-1230 


1945-2223 


109-427 


8989-9170 


607-834 


135-437 1 


Array Coordinate 1 


C4a ! 


in 
< 


A7h | 


C4g | 


Ol 

3 


E7e 


CO 

O 


O 


2 


CN 

tn 
CO 


B7k | 


Bid | 


CO 

CO 


Ala | 


A7k 1 


B6j 1 


D1a I 


c 
CQ 


CO 
CO 

O 


B5m | 


A7d ! 


CM 

Q 


C3c 


C4i 


o 


B5c 


cn 
tn 
O 


A1e 


je 

CQ 


Gene Name 


NADPH-cytochrome P450 reductase | 


Beta-protachykinin a | 


Wee1/p87; cdc2 tyrosine 15-kinase | 


Protein tyrosine phosphatase | 


O 

o 

c 
g 

a. 
*c 
o 
w> 
c 

2 

-a 
a> 
as 
a> 

.3. 

6 

9 

c 


Integrin alpha 7 


Gadd45; growth arrest and DNA-damage-inducibie protein I 


Bci-xL apoptosis regulator (bcl-x long); Bc!-2 family member 


N-myc proto-oncogene protein j 


cAMP-dependent protein kinase type i-beta regulatory chain | 


IRF1; interferon regulatory factor 1 | 


HSP86; heat shock 86kD protein | 


LFA1 -alpha; integrin alpha L; leukocyte adhesion glycoprotein LFA-1 alpha 
chain; antigen CD11A (p180) 


APC; Adenomatous Polyposis Coli protein | 


Cdc25b; cdc25M2; MP 12 (M-phase inducer phosphatase 2) | 


PI3-K p1 10; phosphatidylinositoi 3-kinase catalytic subunit 


HSP27; heat shock 27kD protein 1 | 


Csk; c-Src-kinase and negative reguiator 


Fasl; Fas antigen ligand; generalized tymphoproliferation disease gene 
(gld) in mice 


IMAPK; MAP kinase; p38 ' | 


pi 9ink4; cdk4 and cdk6 inhibitor 


ElM Ets family transcription factor 


CRAF1; TNF receptor (CD40 receptor) associated factor; TRAF-related 


SPI3; serpin; similar to human proteinase inhibitor 6 (placental thrombin 
inhibitor) serine proteinase inhibitor 


pP cell death protein; Fas/APO-1 (CD95) interactor, contains death 

idomain 


SLAP; src-like adapter protein; Eck receptor tyrosine kinase-associated 


I Atm; ataxia teiangtectasia murine homologue 


EB1 APC-binding protein 


(TANK; l-TRAF; TRAF family member associated NF-kB activator 


GenBank # 


D17571 


D17584 


D30743 


D83966 


J05205 


L23423 


|L28177 | 


L35049 


X03919 


IM20473 I 


(M21065 j 


M36830 


CO 

o 
to 
2 


|M88127 j 


|S93521 | 


|U03279 | 


[U03560 ] 


U05247 


U06948 


U10871 


U 19597 


IU19617 


U21050 


U25844 


tn 

CD 

cn 
in 

CM 
3 


U29056 


U43678 


U51196 


IU51907 



WO 98/53103 



PCTAJS98/10561 



Table 2 (cont) 
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00 
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▼* 
i 

cn 
o 
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183-406 


1892-2179 


548-857 
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co 
in 
tn 

55 

CM 
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1366-1706 


1305-1538 
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1432-1459 


■*— 
tn 

r™ 
r— 
t 

CO 

tn 

CO 


942-1276 


1571-1812 

i 


1184-1447 


261-611 


1422-1711 


141-362 


799-1140 


719-987 


617-844 


656-916 


4193-4508 


m 
tn 
cn 

CO 

E 


542-807 


1990-2320 


1159-417 | 


<J> 

CO 
O 

o 

CO 


1146-1442 


o> 

CO 

i 

CO 


far ay Coordinate 1 


21a J 


CO 

O 


cd 
cn 


CO 

LLI 


in 
< 


CO 

<c 


E5n 


A4m 


E7b 


CO 
CQ 


B1b 


A5b 




A1f 


A6c 


E7d 


Ui 


D4h 


<3 
< 


|B5n 


A3d 


A6b 


Aid 


C6i 


to* 

O 




|C6n 


A6k 


B3f 


CO 
LU 


3ene Name i 


:aspase-1 1 ; ICH-3 cysteine protease; upstream regulator of ICE < 


vlLH1 DNA mismatch repair protein; MutL homologue H 


nsulin-like arowth tactor-IA |l 


Dell surface qlvcoprotein MAC-1 alpha subunit I 


c 

*C0 

2 

Q. 

O) 

c 

o 
c 

hS 
c 

Ctf 
Ol 

o 
o 

§ 

o 
o 
a 

? 


_-mvc oroto-oncogene protein I 


DD18 antiaen beta subunit (leukocyte adhesion LFA-1) (CD3, P150 t 95) I 


c-Fqr proto-oncogene | 


Inteorin aloha 4 I 


PKC-beta; protein kinase C beta-ll type 


HSP60; heat shock 60 kDa protein 1 (chaperonin, GroEL homologue); 
mitochondrial matrix protein P1 


c-Cbi proto-oncoqene {Adaptor protein) I 


Cdc25 Dhosohatase; guanine nucleotide releasing protein I 


Ezrin; Viltin 2; NF-2 (merlin) related filament/plasma membrane associated 
orotein 


Cvclin Bl (G2/M-specific) I 


inteqrin alpha 6 


5-Hvdroxytrvptamine (serotonin) receptor 3 I 


Homeobox protein HOXD-3 ! 


ICvclin E {G1/S-specific) 


1 MAPKAPK-2; MAP kinase-activated protein kinase; MAPKAP kinase 2 


Fra-2 (fos-related antigen 2) 


ICvclin A1 (G2/M-specific) 


DCC; netrin receptor; immunoglobulin gene superfamlly member; former 
tumor suppressor protein candidate 


MHR23A; Rad23 UV excision repair protein homologue; xeroderma 
pigmentosum group C (XPC) repair complementing protein 


MHR23B; Rad23 UV excision repair protein homologue; xeroderma 
Diqmentosum group C (XPC) repair complementing protein 


integrin beta 


lMmRad52; veast DNA repair protein Rad52 homologue 


"o 

<D 
CL 
U> 

CD 

CD 
c 

O 


Prostaqlandin E2 receptor EP4 subtype 


lnterleukin-5 receptor 


GenBank # X 


U59463 < 


U59883 f 


X0448G t 


XG7640 < 


X13664 I 


X13945 I 


X14951 i 


X52191 i 


X53176 


X53532 


X53584 


X57111 


X59868 


X60671 


X64713 


X69902 


X72395 i 


X73573 


X75888 


X76850 


X83971 


X84311 


X85788 


X92410 


X92411 


Y00769 


Z32767 


237110 

D13458 


D90205 
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545-809 I 
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i 
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* h; 
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1425-1844 
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*ene Name ^ 


GF-i-R alpha: insuiin-like orowth factor I receptor alpha subunit C 


GFR li; insulin-like growth factor receptor li, cation-independent mannose- 
i-P receDtor: elevated in Wilms's tumor cells < 


tots: APRF: acute ohase response factor |i 


Calcitonin receptor 1b I' 


=nriotheiin b receDtor fEdnrbl I 


CO 

k 

OJ 

s 
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XJ 

c 

O) 

2 
a 

CL 


Pre-olatelet-derived arowth factor receptor I 
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Thrnmhomodulin 1 


intPfieukin 6 (B ceil differentiation factor) I 


Androaen receDtor . _ J 


Rnnfi morohoaenetic orotein 4 (BMP-4) (TGF-beta family) 1 


Transferrin receotor protein (p90, CD71) ] 


Transformina arowth factor beta 2 I 


Glutamate receptor, ionotropic AMPA 1 ! 


ITNF 55: tumor necrosis factor 1 (55kd) , 


C 

g 

UL 

cn 
e 

'3 

CT 
OJ 

CO 

in 
a 

CM 

e 


iTranscriotion factor 1 for heat shock gene 


CD40L; CD40 lioand 


c-Fms proto-oncogene {macrophage colony stimulating factor 1 (CSF-1) 

receotort 


B-mvb Droto-oncoaene; myb-reiated protein B 


lEar-2: v-erbA related oroto-oncogene 


Tie-1 tvrosine-orotein kinase receptor 


(Glutamate receDtor, ionotropic NMDA2B (epsilon 2) 


Glutamate receptor, ionotropic NMDA2A (epsilon 1) 


CD7 antioen 


[TranscriDtion factor S -II (transcription elongation factor) 


I Basic Fibroblast growth factor (b- FGF) 


IBone morDhoaenetic protein receptor 


iG-orotein-coupled receptor 


Transcription factor SP2 
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Position 
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on 
to 


3519-3722 


>553-2830 


31-379 


527-805 


1229-1543 


718-976 


368-675 


452-791 


514-740 


622-1020 


565-945 


cn 
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o 

in 

CM 
00 


00 

CM 

cn 


951-1238 
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CM 
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|C1d 


5eneNarne i 


^dkS: cvclin-deoendent kinase 5 M 


'fiF-beta receotor tvoe 1 j E 


(inesin like orotein KIF 3B I* 


(inesin famitv Drotein K1F1A I* 


"ihmblast Growth factor 9 V 


Neuronal death orotein l 1 


3vd: SH-PTP2; adaptor protein tyrosine phosphatase II 


niftrferon reauiatorv factor 2 (IRF 2) I' 


amimin receptor 1 I 


MF-1 B orotein (transcription factor) I 


Jun-B: c-iun-related transcription factor I 


Tissue olasminoaen activator I 


Hnmfin Box Drotein 4.2 (Hox-4.2) I 


Nnr77 earlv resoonse orotein; thyroid hormone (TR3) receptor 1 


Fts-2 rranscriotion factor i 


n-Aun nroto-oncoaene (transcription factor AP-1 component) 1 


ferine protease inhibitor homolog J6 


Nerve arowth factor beta (beta-NGF) 


Cdk4: cvclin-dependent kinase 4 ! 


I Acetylcholine receptor delta submit 


IMAPKK1: MAP kinase kinase 3 (dual specificity) (MKK1) 


If* ABA- A transoorter4 


IfiABA-A transDorter3 


Vegfrl; Vascular endothelial growth factor receptor 1 / Fms-related tyrosine 
kinase 1 (Fit!) 


Adrenergic receptor, beta 1 


Eoh3 (Nuk) tvrosine-orotein kinase receptor 


ImTJI : DnaJ-like heat-shock protein from mouse tumor 


TlMP-3 tissue inhibitor of metailoproteinases-3 


I Insulin receDtor substrate-1 (IRS-1) 


iYV1 (UCRBP) transcriptional factor 


interleukin-convertinq enzyme (ICE) 


Hepatoma transmembrane kinase ligand 


IVoltaae-aated sodium channel 


Bad; heterodimeric partner for Bcl-XL and Bcl-2; promotes cell death 
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TA8LE2(CONT) 
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F7b 
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CO 
C3 


A7b 




CO 

< 
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C7h 


F6k 


|B3c 


5ene Name * 


ink stress-activated orotein kinase (SAPK) I* 


^skeletal eoidermal keratin (18 human) I f 


jprvA nrnwth factor aloha (alpha-NGF) 1* 


: nidermal keratin (1 human) I* 


Jimiinic acetylcholine receptor 1 ' 


l4HR1 : P-alvcoorotein: multidrug resistance protein; efflux pump |i 


",n? antinen . J 


*— 

>« 
o 
X 

c 

CD 

s 

a 

X 

■<§ 

o 

a; 

£ 
o 
X 


Ffttai mvnsin alkali liaht chain 1 


Intfirleukin 4 I 


Rb; pp105; Retinoblastoma susceptibility-associated protein (tumor 
snnnressor aene: cell cvcle regulator) 


p«k- rihosomai protein S6 kinase 


Platelet- derived orowth factor (A chain) {PDGF- A) | 


Cvtoskeletal eoidermal keratin (19 human) I 


: V(D)J recombination activating protein I 


o 
a 

i 

CO 

' c 
£ 


k-.fihrnhlafit orowth factor 1 


In^tamer hindina transcription factor (Oct 3) ! 


ptasmin^n^n activator inhibitor 


Inra antinan delta DolvDeotide 


1 Homftft Box orotein 2.5 (Hox-2.5) 


insPft4- heat shock 84kD orotein 


iMast r.fill nrotease fMMCP\ - 4 


Frk v *»vtrarfiliiilar siqnal-requlated kinase 1 : p44; Ert2 


PI3-K p85; phosphatidylinositot 3-kinase regulatory subunit; 
nhnRnhonrotein o85: PDGF signaling pathway member 


p58/GTA; galactosyltransferase associated protein kinase (cdc2-related 
prntein kinase) 


IftHrina nrotease inhibitor 2 (spi-2) 
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Table 2 (cont) 



Position 
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182-541 


2142-2423 
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JO 
CO 
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|C1e 


to 
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B6c 


c 
<o 
Q 


D4k 


CM 

o 


A7c 


|B3n 


aene Name | J 


unha rardiac myosin heavy chain 1 


^etinoic acid receptor RXR- gamma U 


anuiocvte-macroDhaae colony-stimulating factor receptor 1 1 


} ABA-A receDtor aloha-1 submit |l 


"ndothelial liaand for L-selectin (GLYCAM 1 ) II 


ntearin beta 7 subunit | 


3NAse 1 


^ortactin: orotein tyrosine kinase substrate 1 


Adenosine A2M2 receptor I 


DNA iiaase I I 


Adenosine A1 M receptor I 


Non-muscle myosin light chain 3 I 


Catheosin H 


Stati : sianal transducer and activator of transcription 


o2 1 /Cio1 /Waf 1 : cdk-inhibitor protein 1 I 


Cdk7; M015; cyclin-dependent kinase 7 (homologue of Xenopus M015 
Icdk-activatina kinase) 


o27kio1 : G1 cvdin-Cdk protein kinase inhibitor, p21-reiated i 


Gem: induced, immediate early protein; Ras family member 


VHL: Von Hippei-Lindau tumor suppressor protein 


I Cek 5 receptor protein tyrosine kinase ligand 


(Glutathione peroxidase {plasma protein): selenoprotein. 
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I Abiphitin-1 (abi-1) similar to HOXD3 


lBAG-1: bcl-2 binding protein with anti-ceil death activity 


She transforming adaptor protein; Src homology 2 (SH2) protein, SHB- 
related 


MAPKK4; MAP kinase kinase 4; Jnk activating kinase 1; (JNKK1; SEKI; 
MKK4) 


(Transcription factor LRG - 21 


(interferon inducible protein 1 


!A20 zinc finaer protein; apoptosis inhibitor 


Tp18ink4; cdk4 and cdk6 inhibitor 


l-kB (l-kappa B) beta 


iQenBanki l< 


M76601 / 


M84819 F 


M8507B < 


I M 86 5 66 ( 
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Table 2 (cont) 
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CO 
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|D6k 


iftne Name k 


\uu>- riishevelled-2 tissue oolaritv protein I* 


JnrlAar factor related to P45 NF-E2 H 


j!SH2 DNA mismatch repair protein; MutS homologue 2 < 


^nill- GTPase-activatina orotein |i 


tvrnsine-Drotein kinase (activated p21 cdc42Hs kinase {ack)) 1 1 


>107; RBL1; Retinoblastoma gene product-related protein p107 (celt cycle 


li i9ft7?4 IPMS2 DNA mismatch repair protein; yeast PMS1 homolog 2 I 


2? 

E 

«a 

cr 
u. 
2: 

0 

"S. 

0) 

o 

c 

0 
0 

c 

CO 

I s - 
01 

l C\J 

r: 


li i^iAOR IBRCA1: Breast/ovarian cancer susceptibility locus 1 product I 


Pmt* Murine homoloaue of the leukemia-associated PML gene I 


Transducin beta-2 subunit 


i-kB (l-kaoDa B) alpha chain 


TRAIL: TNF-related apoptosis inducing ligand; Apo-2 ligand I 


p130; Retinoblastoma gene product-related protein Rb2/p130 (cell cycle 


HACCC Box- bindina orotein BKLF 


FAF1 : Fas-associated protein factor, apoptosis activator 


7inc finoer transcription factor RU49 . 


rvrRP- G/T-mismatch bindina protein; MSH6 


I PI n heta: ohosDhoiioase C beta 3 


Frizzled-3; Drosophila tissue polarity gene frizzled homologue 3; 
dishevelled receptor 


1MAPKK3: MAP kinase kinase 3 (dual specificity) (MKK3, MEK3) 


Mvelobiastin: trvosin-chvmotrvpsin related serine protease 


IZinc finaer Kruppel type Zfp 92 


ItDAG51; couples TCR signaling to Fas (CD95) expression 


I POU domain, class 2. associated factor 1 


I Cas: Crk-associated substrate; focal adhesion kinase substrate 


I ALG-2: calcium binding protein required for programmed cell death 


I Unconventional mvosin VI 


Transcription factor CTCF (1 1 zinc fingers ) 


Transcription factor C 1 
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Table2(cont) 
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■*— 

CO 
CO 
CO 


584-882 


379-667 


482-734 


00 
CO 
IO 

ts 

CM 


41-408 


2342-2598 


439-666 


CM 
CM 

r» 

cn 


1949-2284 


920-1278 


674-978 


to 

I s - 
co 

6 
o 
rr 


|587-B94 


868-1096 


643-1017 


CM 

IO 

CO 

*r 


670-971 


866-1128 


826-1132 


o 

CM 
00 
CM 

CO 

-<r 

rf 

CM 


425-651 


00 

cn 
cn 

cn 

03 
CO 


11898-2182 I 


904-1150 


j 

CO 
1 

CO 
CO 
CM 


[1702-2040 | 


-\rray Coordinate 1 






F3h ! 


a> 


<d 

LO 

Q 


O 


B5a 


S 


A2h 


CO* 

u. 


c 

CO 
LU 


A4I 


£ 


D4d 


D4b 


A3c 


F7t 


A2g I 


JZ 

<o 
u. 


|F6d 


E 

CO 

a 


F3g 


|F1e 


K 
a 


Q 


_Q 
CM 
LU 


B7c 


xz 
I s - 
a 


c 
to 

LL 


r- 

a 


E 

I s - 

LL 


|D3a 


Sene Name i 


rfadrl; mSmadl; Mothers against dpp protein (Mad) murine homologue; 
fGF-beta sianaiino protein- 1 (bsp-1); candidate tumor suppressor gene i 


3ci-W aooDtosis reaulator; Bcl-2 family member H 


dad related protein 2 (MADR2) 1 


2vctinC(G1 -specific) [ 


^Dh-1 nuclear transcriptional repressor for hox genes I 


RadBO: DNA reoair protein I 1 


Fvn oroto-oncoaene; Src family member 


c-mvc oroto-oncoqene protein 


c-Fos proto-oncogene; transcription factor AP-1 component, fos cellular 

nncooene 


Cathepsin L I 


Glutamate receptor channet subunit gamma 


c-Fes oroto-oncoaene I 


Cvtotoxic cell protease 2 (B10) I 


Homeo Box Drotein 3.1 (Hox-3.1) I 


Homeo Box Drotein 2.4 (Hox-2.4) I 
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r^-FrhA oncoaene: thvroid hormone receptor. I 
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IVimentin 


I HMG-14 non histone chromosomal protein 


IMacroohaae inflamatory protein 2 alpha (MiP 2 alpha) 


I Bone morohoaenetic protein 7 (BMP-7) (osteogenic protein 1) 


(Transcription factor SP1P (POUdomain transcription factor ) 


I Homeo Box protein 8 (Hox-8) 


I Fibroblast Qiowth factor receptor 4 


IRad murine homologue 


iTranscriotion factor UBF 


l Kinesin heavv chain 


CCAAT- Bindinq transcription factor {C/ EBP) 


lTlMP-2 tissue inhibitor of metalIoproteinases-2 


Ets-related protein PEA 3 
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Table 2 (cont) 



Position | 


1083-1351 | 


1081-1325 | 


874-1236 I 


17-332 


148M734 | 


1064-1304 j 


1394-1721 


746-985 | 


552-916 | 


4137-4375 | 


925-1305 | 


982-1314 | 


599-954 | 


447-669 I 


1595-1976 | 


939-1329 | 


781-1140 j 


27-256 | 


449-817 


461-824 | 


[701-1039 I 


11017-1334 I 


1877-1101 j 


1498-1680 


1426-728 1 


1 1000-1 306 I 


11095-1323 1 


1755-1054 I 


807-1197 I 
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14-298 
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Vav; GDP-GTP exchange factor; proto-oncogene | 


PAX-6 (paired box protein) I 


'o 

CD 
Q. 
i/i 
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S 

CM 
CQ 
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*o 
>% 
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ChopIO; murine homologue of Gadd153 (growth arrest and DNA-damage- 
inducible proein) 


PD-1 possible ceil death inducer; Ig gene superfamiiy member 


Inhibin beta A subunit (TGF beta family) | 


Vegfr2; KDFt/flk1 vascular endothelial growth factor tyrosine kinase 

receptor 


Protease nexin 1 (PN-1) | 


MRE-binding transcription factor | 


Activator -1 140 KD subunit {replication factor C 140KD) 


DP-1 {DRTF-polipeptide 1) cell cycle regulatory transcription factor | 


O 
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O 

Q. 

<U 

u 

s> 

'c 
o 

2 

0) 

(/) 

s 

G 
& 

o 
■o 

X 
in 


Getatinase B I 


XPAC; xeroderma pigmentosum group A correcting protein | 


tntegrin alpha 2 (CD49b) | 


Growth/ diffferentiation factor 2 (GDF-2) | 


Insulin-like growth factor binding protein-4 (IGFBP-4) 


Insulin-like growth factor binding protein-1 (IGFBP-t) I 


IGFBP-2; insutin-like growth factor binding protein 2; autocrine and/or 
paracrine growth promoter 


Insulin-like growth factor binding protein-5 (IGFBP-5) 


Insulin-like growth factor binding protein -6 (IGFBP 6) 


A-myb proto-oncogene; myb-related protein A 


[Membrane type matrix matalioproteinase 


Eik-1 ets-related proto-oncogene 


(E2F-5 transcription factor 


Lbx 1 transcription factor 


|P-selectin (glycoprotein iigand-1) 


| Transcription factor SEF2 


|Macrophage mannose receptor 


|Rab-2 ras-related protein 


Gluthathione S-transferase (theta typel); phase 11 conjugation enzyme 
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Cancer Array 

In the cancer arrays of the subject invention, the polynucleotide probe compositions 
on the array correspond to those genes which are associated, e.g. play a role in, cellular 
proliferative diseases, particularly cancer, where human genes are of particular interest in 
many embodiments. Types of genes that are typically represented on a cancer array of the 
subject invention include: oncogenes, tumor suppressors, cell cycle regulators, genome 
plasticity genes, apoptosis genes, cell differentiation genes, regulators of tumor host 
interaction and metastasis, such as extracellular matrix proteins, cell adhesion receptors, 
molecules that control cell invasion and motility, and genes associated with angiogenesis. 

In certain embodiments, of particular interest is an array having the following types 
of genes represented on its surface: cell cycle/growth regulators: apoptosis; growth 
factors/cytokines; oncogenes/tumor suppressors; cell adhesion, motility and invasion; 
invasion regulators; OTP ases and their regulators; cadherins; intermediate filament markers; 
receptors; cell fate/development regulators; DNA damage/response/repair/ recombination; 
and angiogenesis regulators. In a specific cancer array of interest, the spots are as listed in 
Table 3. 

The cancer array finds use in a variety of applications, including: monitoring cellular 
responses to therapeutic compounds; comparing expression profiles of tumors at different 
developmental stages; developing diagnostic tools for distinguishing closely related tumors; 
and the like. 

In the following Table 3, as well as preceding Tables 1 and 2, the "position- 
coordinate refers to the actual nucleotide residues of the listed gene that are represented on 
the array. 
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BASIC FIBROBLAST GROWTH FACTOR RECEPTOR 1 PRECURSOR I 
(BFGF-R) (EC 27.1.112) (FMS-LIKE TYROSINE KINASE-2) (C-FGR) I 
(FGFR1) (FLG) (FGFBR) (FLT2). (HBGF-R-ALPHA-A1) (HBGF-R-ALPHA- 
A2) (HBGF-R-ALPHA-A3) + FGFR SECRETED FORM (M34188) 


MFRVF GROWTH FACTOR RECEPTOR I 
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transmembrane receptor precursor (PTK7); COLON CARCINOMA 
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HIGH AFFINITY NERVE GROWTH FACTOR RECEPTOR PRECURSOR 
{EC 2.7.1.112) (TRK1 TRANSFORMING TYROSINE KINASE PROTEIN) 
(P140-TRKA) + trk-T3 (P68 TRK-T3 ONCOPROTEIN) 
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Iti imor MFHROSIS FACTOR RECEPTOR 1 


TUMOR NECROSIS FACTOR RECEPTOR 2 PRECURSOR (TUMOR 
NECROSIS FACTOR BINDING PROTEIN 2) (TBPIl) (P80) (TNF-R2) 
rP7Kl {Cm?C\B\ /TNFR21 fTNFBR), 


RETINOIC ACID RECEPTOR ALPHA1 (RAR- ALPHA1) + PML-RAR 

nrotein 


retinoic acid receptor alpha [RETINOIC ACiD RECEPTOR RXR-ALPHA 


retinoic acid receptor epsilon (RETINOIC ACID RECEPTOR BETA-2 (RAF 

RFTA-?\ fRAR-EPSILONH 


retinoic acid receptor gamma [RETINOIC ACiD RECEPTOR GAMMA] 


retinoic acid receptor Dtr-beta [RETINOIC ACID RECEPTOR RXR-BETA] 
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VASCULAR ENDOTHELIAL GROWTH FACTOR RECEPTOR 2 
PRECURSOR (EC 27.1.112) (VEGFR-2) (KDR) (KINASE INSERT 
noMAlN RECEPTOR* (FRAGMENT) 


VASCULAR ENDOTHELIAL GROWTH FACTOR RECEPTOR 3 
PRECURSOR (EC 2.7.1.112) (VEGFR-3) (TYROSINE-PROTEIN KINASE 
RECEPTOR FLT4. CLASS 111). 


FL CYTOKINE RECEPTOR PRECURSOR (EC 2,7.1.1 12) (TYROSINE- 
PROTEIN KINASE RECEPTOR FLT3) (STEM CELL TYROSINE KINASE 
1WSTK-1WCD13S ANTIGEN). 


TYROSINE-PROTEIN KINASE RECEPTOR TIE-1 PRECURSOR (EC 
2.7.1.1121. 


TYROSINE-PROTEIN KINASE RECEPTOR TIE-2 PRECURSOR (EC 
2.7.1 .112) (TYROSINE-PROTEIN KINASE RECEPTOR TEK) (P140 TEK) 
(TUNICA INTERNA ENDOTHELIAL CELL KINASE). 
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PLASMA SERINE PROTEASE INHIBITOR PRECURSOR (PCI) 
(PROTEIN C INHIBITOR) (PLASMINOGEN ACTIVATOR INHIBITOR-3) 


UROKINASE PLASMINOGEN ACTIVATOR SURFACE RECEPTOR, GPI- 1 
ANCHORED FORM PRECURSOR (U-PAR) (MONOCYTE ACTIVATION 
amtigcm un« /r.nR7 ANTIGEN* ! 


LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 1 
PRECURSOR (LRP) (ALPHA-2-MACROGLOBULIN RECEPTOR) (A2MR) 


LOW-DENSITY LIPOPROTEIN RECEPTOR-RELATED PROTEIN 2 
fUEGALIN) (GLYCOPROTEIN 330) (FRAGMENTS 


ALPHA-2-MACRQGLOBULlN rutounoun {tv-rrw^m) 
PLATELET BASIC PROTEIN PRECURSOR (PBP) (CONTAINS: 
CONNECTIVE-TISSUE ACTIVATING PEPTIDE til {CTAP-III), LOW- 
AFFINITY PLATELET FACTOR IV (LA-PF4), BETA- 
THROMBOGLOBULIN (BETA-TG), NEUTROPHIL-ACTIVATING 


ALPHA-2-MACR0GL0BULIN RECEPTOR-ASSOCIATED PROTEIN 
PRECURSOR (ALPHA-2-MRAP) (LOW DENSITY LIPOPROTEIN 
cc^cD-rnQ.npr ATPD PROTEIN- ASSOCIATED PROTEIN 1) (RAP) 


NUCLEOSIDE DIPHOSPHATE KINASE A (EC 2.7.4.6) (NDK A) (NDP 
KINASE A) (TUMOR METASTATIC PROCESS-ASSOCIATED PROTEIN) 
(METASTASIS INHIBITION FACTOR NM23) (NM23-H1). 
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FACTOR 8) (HBGF-8) {OSTEOBLAST SPECIFIC FACTOR 1) (OSF-1) 
(HEPARIN-BINDING NEURITS OUTGROWTH PROMOTING FACTOR 1) 
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HEPATOCYTE GROWTH FACTOR-LIKE (macrophage-stimulating I 
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(fiR (HFPATOPOEiTIN A). 


HfiF AG0NIST/ANTAGU1N3 1 

COMPETITIVE HEPATOCYTE GROWTH FACTOR ANTAGONIST. AN 
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Apoptosis Array 

In the apoptosis array according to the subject invention, all of the unique 
polynucleotide probe compositions correspond to genes that are associated with apoptosis, 
e.g. cell cycle genes. In a specific apoptosis array of interest, the spots are as provided in 
Table 4. 
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Human Stress Array 

In the human stress array according to the subject invention, ail of the unique 
polynucleotide probe compositions correspond to genes that are associated with stres 
responses of human ceils, e.g. stress response regulators and effectors. In a specific h 
stress array of interest, the spots are as provided in Table 5. 
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Oncogene and Tumor Suppressor Gene Array 

In the oncogene and tumor suppressor gene array according to the subject invention, 
all of the unique polynucleotide probe compositions correspond to genes that are associated 
with cellular proliferative diseases, specifically neoplastic diseases. Genes of interest that 
may be represented on the array include: oncogenes and tumor suppressor genes. In a 
specific oncogene and tumor suppressor gene array of interest, the spots are as provided in 
Table 6. 
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Table 6 (Cont) 



3ene Name _ 


\M (rac protein kinase alpha, protein kinase B t c-Akt) 
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Cell-Cell Interaction Array 

In the cell-ceil interaction array according to the subject invention, all of the unique 
polynucleotide probe compositions correspond to genes that are associated with cell-cell 
interaction, e.g. cell-cell signaling. In a specific cell-cell interaction array of interest, the 
spots are as provided in Table 7. 
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SELL INTERACTION (Gene Names) 1 
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Table7(Cont) 



:ELL INTERACTION (Gene Names) 1 
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Table 7 (Cont) 



DELL INTERACTION (Gene Names) 1 
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DELL INTERACTION (Gene Names) I 


DADHER1N-13 T-CADHERIN PRECURSOR (TRUNCATED-CADHERiN) (H-CADHERIN) 
[HEART-CADHERIN) 
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ANTIGEN) (CD144 ANTIGEN). 


lALPHA-2-MACROGLOBULIN PRECURSOR (ALPHA-2-M) | 
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Table 7 (Cont) 



CELL INTERACTION (Gene Names) 1 


EPHR1N TYPE-A RECEPTOR 2 PRECURSOR (EC 2.7.1 .1 12) (TYROSINE -PROTEIN 
KINASE RECEPTOR ECK) (EPITHELIAL CELL KINASE). 
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Cytokine and Cytokine Receptor Array 

In the cytokine and cytokine receptor array according to the subject invention, ail of 
the unique polynucleotide probe compositions correspond to genes that express cytokines or 
cytokine receptors. In a specific cytokine and cytokine receptor array of interest, the spots 
are as provided in Table 8. 
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Cell Cycle Array 

In the cell cycle array according to the subject invention, all of the unique 
polynucleotide probe compositions correspond to genes that are associated with the life cycle 
of a cell. In a specific cell cycle array of interest, the spots are as provided in Table 9. 
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Other Representative Arrays 

In a neuroarray according to the subject invention, ail of the unique polynucleotide 
probe compositions will correspond to genes that are expressed in brain related tissues. 
Genes that are represented on the array are key genes, by which is meant that they have been 
5 reported to play primary roles in a variety of different biological processes in brain tissues. 
Genes of interest that may be represented on the array include: ion channel/transport 
proteins; receptors; cell cycle regulators; stress response proteins; apoptosis proteins; signal 
transduction proteins; transcriptional factors; growth factors/interleukins/hormones; 
oncogenes and tumor suppressors; cell surface/adhesion proteins; DNA 
10 synthesis/repair/recombination genes; and metabolic pathway enzymes. 

In certain embodiments, of particular interest is an array having the following types 
of genes represented on its surface: nuclear proteins; endoplasmic reticulum proteins; golgi 
complex proteins; endosomal proteins; lysosomal proteins; peroxisomal proteins; 
mitochondrial proteins; cytoplasmic proteins; cytoskeletal proteins; plasma membrane 
1 5 proteins; post synaptic and dendritic proteins; axonal and nerve terminal proteins; secreted 
proteins, neuropeptides, hormones and growth factors; extracellular matrix proteins; 
astrocyte and oligodendroglial proteins; immune system proteins; developmentally regulated 
proteins; regionally regulated proteins; and disease related proteins. 

Other representative arrays include: (1) rat arrays, in which each of the unique 
20 polynucleotide corresponds to a key rat gene; (2) blood arrays, in which each unique 
polynucleotide corresponds to a gene associated cells and tissues associated with the 
cardiovascular system; (3) rat stress arrays; and (4) mouse stress arrays, in which each 
unique polynucleotide corresponds to a gene associated with the stress response of murine 
cells. 



25 



Methods of Using the Subject Arrays 

The subject arrays find use in a variety of different applications in which one is 
interested in detecting the occurrence of one or more binding events between target nucleic 
acids and probes on the array and then relating the occurrence of the binding event(s) to the 
presence of a target(s) in a sample. In general, the device will be contacted with the sample 
suspected of containing the target under conditions sufficient for binding of any target 
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present in the sample to a complementary polynucleotide present on the array. Generally, the 
sample will be a fluid sample and contact will be achieved by introduction of an appropriate 
volume of the fluid sample onto the array surface, where introduction can be pipette, 
deposition, and the like. 

5 

Generation of Labeled Target 

Targets may be generated by methods known in the art. mRNA can be labeled and 
used directly as a target, or converted to a labeled cDNA target. Generally, such methods 
include the use of oligonucleotide primers. Primers that may be employed include oligo dT, 

1 0 random primers, e.g. random hexamers and gene specific primers. 

Of particular interest in the generation of labeled target is the use of a set of a 
representational number of gene specific primers, as described in U.S. Patent Application 
No. 08/ 859,998, the disclosure of which is herein incorporated by reference. As the subject 
sets comprise a representational number of primers, the total number of different primers in 

1 5 any given set will be only a fraction of the total number of different or distinct RNAs in the 
sample, where the total number of primers in the set will generally not exceed 80 %, usually 
will not exceed 50 % and more usually will not 20% of the total number of distinct RNAs, 
usually the total number of distinct messenger RNAs (mRNAs), in the sample. Any two 
given RNAs in a sample will be considered distinct or different if they comprise a stretch of 

20 at least 100 nucleotides in length in which the sequence similarity is less than 98%, as 

measured using the FASTA algorithm at default settings. As the sets of gene specific primers 
comprise only a representational number of primers, with physiological sources comprising 
from 5,000 to 50,000 distinct RNAs, the number of different gene specific primers in the set 
of gene specific primers will typically range from about 20 to 10,000, usually from 50 to 

25 2,000 and more usually from 75 to 1 500. 

Each of the gene specific primers of the sets described above will be of sufficient 
length to specifically hybridize to a distinct nucleic acid member of the sample, e.g. RNA or 
c DNA, where the length of the gene specific primers will usually be at least 8 nt, more 
usually at least 20 nt and may be as long as 25 nt or longer, but will usually not exceed 50 nt. 

30 The gene specific primers will be sufficiently specific to hybridize to complementary 

template sequence during the generation of labeled nucleic acids under conditions sufficient 
for first strand cDNA synthesis, which conditions arc known by those of skill in the art. The 
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number of mismatches between the gene specific primer sequences and their complementary 
template sequences to which they hybridize during the generation oflabeled nucleic acids in 
the subject methods will generally not exceed 20 number %, usually will not exceed 10 
number % and more usually will not exceed 5 number %. 

5 Generally, the sets of gene specific primers will comprise primers that correspond to 

at least 20, usually at least 50 and more usually at least 75 distinct genes as represented by 
distinct mRNAs in the sample, where the term "distinct" when used to describe genes is as 
defined above, where any two genes are considered distinct if they comprise a stretch of at 
least 100 nt in their RNA coding regions in which the sequence similarity does not exceed 

1 0 98%, as determined using the F ASTA algorithm at default settings. 

The gene specific oligonucleotide primers may be synthesized by conventional 
oligonucleotide chemistry methods, where the nucleotide units may be: (a) solely nucleotides 
comprising the heterocyclic nitrogenous bases found in naturally occurring DNA and RNA, 
e.g. adenine, cytosine, guanine, thymine and uracil; (b) solely nucleotide analogs which are 

1 5 capable of base pairing under hybridization conditions in the course of DNA synthesis such 
that they function as the above nucleotides found in naturally occurring DNA and RNA, 
where illustrative nucleotide analogs include inosine, xanthine, hypoxanthine, 1,2- 
diaminopurine and the like; or (c) from combinations of the nucleotides of (a) and nucleotide 
analogs of (b), where with primers comprising a combination of nucleotides and analogues 

20 thereof, the number of nucleotide analogues in the primers will typically be less than 25 and 
more typically less than 5. The gene specific primers may comprise reporter or hapten 
groups, usually t to 2, which serve to improve hybridization properties and simplify 
detection procedure. 

Depending on the particular point at which the gene specific primers are employed in 
25 the generation of the labeled nucleic acids, e.g. during first strand cDNA synthesis or 
following one or more distinct amplification steps, each gene specific primer may 
correspond to a particular RNA by being complementary or similar, where similar usually 
means identical, to the particular RNA. For example, where the gene specific primers are 
employed in the synthesis of first strand cDNA, the gene specific primers will be 
30 complementary to regions of the RNAs to which they correspond. 

Each gene specific primer can be complementary to a sequence of nucleotides which 
is unique in the population of nucleic acids, e.g. mRNAs, with which the primers are 

-138- 



WO 98/53103 



PCT/US98/10561 



to 



contacted, or one or more of the gene specific primers in the set may be complementary to 
several nucleic acids in a given population, e.g. multiple mRNAs. such that the gene specific 
primer generates labeled nucleic acid when one or more of set of related nucleic acid species, 
e.g. species having a conserved region to which the primer corresponds, are present in the 
sample. Examples of such related nucleic acid species include those comprising: repetitive 
sequences, such as Alu repeats, Al repeats and the like; homologous sequences in related 
members of a gene-family; polyadenylation signals; splicing signals; or arbitrary but 

conversed sequences. 

Depending on the particular nature of the labeled nucleic acid generation step of the 
subject methods, the gene specific primers may be modified in a variety of ways. One way 
the gene specific primers may be modified is to include an anchor sequence of nucleotides, 
where the anchor is usually located 5' of the gene specific portion of the primer and ranges in 
length from 10 to 50 nt in length, usually 15 to 40 nt in length. The anchor sequence may 
comprise a sequence of bases which serves a variety of functions, such as a sequence of 
1 5 bases which correspond to the sequence found in promoters for bacteriophage RNA 

polymerase, e.g. T7 polymerase, T3 polymerase, SP6 polymerase, and the like; arbitrary 
sequences which can serve as subsequent primer binding sites; and the like. 

Turning now to the methods employing the above sets of gene specific primers, the 
first step in the subject methods is to obtain a sample of nucleic acids, usually RNAs, from a 
20 physiological source, usually a plurality of physiological sources, where the term plurality is 
used to refer to 2 or more distinct physiological sources. The physiological source of RNAs 
will typically be eukaryotic, with physiological sources of interest including sources derived 
single celled organisms such as yeast and multicellular organisms, including plants and 
animals, particularly mammals, where the physiological sources from multicellular 
25 organisms may be derived from particular organs or tissues of the multicellular organism, or 
from isolated cells derived therefrom. Thus, the physiological sources may be different cells 
from different organisms of the same species, e.g. cells derived from different humans, or 
cells derived from the same human (or identical twins) such that the cells share a common 
genome, where such ceils will usually be from different tissue types, including normal and 
30 diseased tissue types, e.g. neoplastic, cell types. In obtaining the sample of RNAs to be 
analyzed from the physiological source from which it is derived, the physiological source 
may be subjected to a number of different processing steps, where such processing steps 
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might include tissue homogenation, nucleic acid extraction and the like, where such 
processing steps are known to the those of skill in the art. Methods of isolating RNA from 
cells, tissues, organs or whole organisms are known to those of skill in the art and are 
described in Maniatis et a/., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor 
5 Press)(1989). 

The next step in the subject methods is the generation of labeled nucleic acids 
representative of the nucleic acid, usually RNA, profile of the physiological source. As 
mentioned above, a set of gene specific primers is used to generate the labeled nucleic acids 
from the sample of RNAs, where the labeled nucleic acids generated in this step may serve 
10 as "target" in subsequent assays in which the differences in the RNA profiles of at least two 
sources are analyzed. As used herein, the term "target" refers to single stranded RNA, single 
stranded DNA and double stranded DNA, where the target is generally greater than 50 nt in 
length. 

The set of primers may be used either in first strand cDNA synthesis or following 

15 one or more amplification steps. Furthermore, the actual synthesis of the labeled nucleic 
acids may be at the same step during which the sets of gene specific primers are employed, 
or the synthesis of the labeled nucleic acids may be one more steps subsequent to the step in 
which the sets of gene specific primers are employed. 

In a first embodiment of the invention, the set of gene specific primers is used to 

20 generate labeled first strand cDNA, where the labeled first strand cDNA is representative of 
the RNA profile of the physiological source being assayed. The labeled first strand cDNA is 
prepared by contacting the RNA sample with the primer set and requisite reagents under 
conditions sufficient for reverse transcription of the RNA template in the sample. Requisite 
reagents contacted with the primers and RNAs are known to those of skill in the art and will 

25 generally include at least an enzyme having reverse transcriptase activity and dNTPs in an 
appropriate buffer medium. 

A variety of enzymes, usually DNA polymerases, possessing reverse transcriptase 
activity can be used for the first strand cDNA synthesis step. Examples of suitable DNA 
polymerases include the DNA polymerases derived from organisms selected from the group 

30 consisting of a thermophilic bacteria and archaebacteria, retroviruses, yeasts, Neurosporas, 
Drosophilas, primates and rodents. Preferably, the DNA polymerase will be selected from 
the group consisting of Moloney murine leukemia virus (M-MLV) as described in United 
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States Patent No. 4,943,531 and M-MLV reverse transciptase lacking RNaseH activity as 
described in United States Patent No. 5,405,776 (the disclosures of which patents are herein 
incorporated by reference), human T-cell leukemia virus type I ( HTLV-I ), bovine leukemia 
virus ( BLV ), Rous sarcoma virus (RSV ), human immunodeficiency virus ( HIV ) and 
Thermus aquaticus ( Taq ) or Thermus thermophilus (Tth) as described in United States 
Patent No. 5,322,770, the disclosure of which is herein incorporated by reference. Suitable 
DNA polymerases possessing reverse transcriptase activity may be isolated from an 
organism, obtained commercially or obtained from cells which express high levels of cloned 
genes encoding the polymerases by methods known to those of skill in the art, where the 
particular manner of obtaining the polymerase will be chosen based primarily on factors such 
as convenience, cost, availability and the like. 

The various dNTPs and buffer medium necessary for first strand cDNA synthesis 
through reverse transcription of the primed RNAs may be purchased commercially from 
various sources, where such sources include Clontech, Sigma, Life Technologies, 
15 Amersham, Boehringer-Mannheim. Buffer mediums suitable for first strand synthesis will 
usually comprise buffering agents, usually in a concentration ranging from 10 to 100 uM 
which typically support a pH in the range 6 to 9, such as Tris-HCl, HEPES-K.OH, etc.; salts 
containing monovalent ions, such as KC1, NaCl, etc.", at concentrations ranging from 0-200 
mM; salts containing divalent cations like MgCl,, Mg(OAc) etc, at concentrations usually 
20 ranging from 1 to 10 mM; and additional reagents such as reducing agents, e.g. DDT, 

detergents, albumin and the like. The conditions of the reagent mixture will be selected to 
promote efficient first strand synthesis. Typically the set of primers will first be combined 
with the RNA sample at an elevated temperature, usually ranging from 50 to 95 °C, 
followed by a reduction in temperature to a range between about 0 to 60 °C, to ensure 
25 specific annealing of the primers to their corresponding RNAs in the sample. Following this 
annealing step, the primed RNAs are then combined with dNTPs and reverse transcriptase 
under conditions sufficient to promote reverse transcription and first strand cDN A synthesis 
of the primed RNAs. By using appropriate types of reagents, all of the reagents can be 
combined at once if the activity of the polymerase can be postponed or timed to start after 
30 annealing of the primer to the RNA. 

In this embodiment, one of either the gene specific primers or dNTPs. preferably the 
dNTPs, will be labeled such that the synthesized cDNAs are labeled. By labeled is meant 
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that the entities comprise a member of a signal producing system and are thus detectable, 
either directly or through combined action with one or more additional members of a signal 
producing system. Examples of directly detectable labels include isotopic and fluorescent 
moieties incorporated into, usually covalently bonded to, a nucleotide monomeric unit, e.g. 

5 dNTP or monomeric unit of the primer. Isotopic moieties or labels of interest include 32 P, 
J3 P, J5 S, l25 I, and the like. Fluorescent moieties or labels of interest include coumarin and its 
derivatives, e.g. 7-amino-4-methylcoumarin, aminocoumarin, bodipy dyes, such as Bodipy 
FL, cascade blue, fluorescein and its derivatives, e.g. fluorescein isothiocyanate, Oregon 
green, rhodamine dyes, e.g. texas red, tetramethylrhodamine, eosins and erythrosins, cyanine 

10 dyes, e.g. Cy3 and Cy5, macrocyclic chelates of '"nthanide ions, e.g. quantum dye™, 

fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, TOTAB, etc. 
Labels may also be members of a signal producing system that act in concert with one or 
more actional members of the same system to provide a detectable signal. Illustrative of 
such i 'bel? are members of a specific binding pair, such as ligands, e.g. biotin, fluorescein, 

1 5 digoxigenin, antigen, polyvalent cations, chelator groups and the like, where the members 
specifically bind to additional members of the signal producing system, where the additional 
members provide a detectable signal either directly or indirectly, e.g. antibody conjugated to 
a fluorescent moiety or an enzymatic moiety capable of converting a substrate to a 
chromogenic product, e.g. alkaline phosphatase conjugate antibody, and the like. 

20 In one preferred embodiment, the member of the signal producing system bound to 

the nucleotide is functional group capable of covalently binding to additional members of the 
signal producing system to generate a detectable label. Examples of such functional groups 
or moieties include amino, sulfhydryl, azido, isothiocyanate, sulfoxyl, and the like. The 
labeled target generated using such nucleotides will thus include one or more, usually a 

25 plurality of, functional moieties. For detection, the functional moieties of the modified 
nucleotides can be labeled by conjugation of a label to the functional moiety. A variety of 
suitable labels and methods for their conjugation to functional moieties are known to those 
of skill in the art. Examples include labeling of amino-modified cDNA by a succinimidyl 
ester of an appropriate dye, e.g. Alexa. Bodipy, orCy dyes. Alternatively, label can be 

30 entrapped or bonded into structures of microscopic-sized particles. These particles can then 
be conjugated with the functional moieties of the target. 
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For each sample of RNA, one can generate labeled oligos with the same labels. 
Alternatively, one can use different labels for each physiological source, which provides for 
additional assay configuration possibilities, as described in greater detail below. 

In a variation of the above embodiment, where desired one can generate labeled RNA 
5 instead of labeled first strand cDNA. In this embodiment, first strand cDNA synthesis is 
carried out in the presence of unlabeled dNTPs and unlabeled gene specific primers. 
However, the primers are optionally modified to comprise a promotor for an RNA 
polymerase, such as T7 RNA polymerase, T3 RNA polymerase. SP6 RNA polymerase, and 
the like. In this embodiment, following first strand cDNA synthesis, the resultant single 

1 0 stranded cDNA is then converted to double stranded cDN A, where the resultant double 

stranded cDNA comprises the anchor sequence comprising the promoter region. Conversion 
of the mRNA:cDNA hybrid following first strand synthesis can be carried out as described 
in Okayama & Berg, Mol. Cell. Biol. (1982) 2:161-170, and Gubler & Hoffman, Gene 
(1983) 25: 253-269, where briefly the RNA is digested with a ribonuclease, such as E.coli 

1 5 RNase H, followed by repair synthesis using a DNA polymerase like DNA polymerase I, 
etc., and E.coli DNA ligase. One may also employ the modification of this basic method 
described in Wu, R, ed M Methods in Enzymology (1987), vol. 153 (Academic Press). Next, 
the double stranded cDNA is contacted with RNA polymerase and dNTPs, including labeled 
dNTPs as described above, to produce linearly amplified labeled ribonucleic acids. For 

20 cDNA lacking the anchor sequence comprising a promoter region, a polymerase that does 
not need a promoter region but instead can initiate RNA strand synthesis randomly from 
cDNA, such as core fragment of E.Coli RNA polymerase, may be employed. 

In another embodiment of the subject invention, the labeled nucleic acid generation 
step comprises one or more enzymatic amplification steps in which multiple DNA copies of 

25 the initial RNAs present in the sample are produced, from which multiple copies of the 
initial RNA or multiple copies of antisense RNA (aRNA) may be produced, using the 
polymerase chain reaction, as described in U,S. Pat. No. 4,683,195, the disclosure of which 
is herein incorporated by reference, in which repeated cycles of double stranded DNA 
denaturation, oligonucleotide primer annealing and DNA polymerase primer extension are 

30 performed, where the PCR conditions may be modified as described in U.S. Pat No. 
5,436,149, the disclosure of which is herein incorporated by reference. 
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In one embodiment involving enzymatic amplification, the set of gene-specific 
primers are employed in the generation of the first strand cDNA, followed by amplification 
of the first strand cDNA to produce amplified numbers of labeled cDNA. In this 
embodiment, as a set of gene-specific primers is employed in the first strand synthesis step, 

5 only a representative proportion of the total RNA in the sample is amplified during the 
subsequent amplification steps. 

Amplification of the first strand cDNA can be conveniently achieved by using a 
CAPswitch™ oligonucleotide as described in U.S. Patent Application Serial No. 08/582,562, 
the disclosure of which is herein incorporated by reference. Briefly, the CAPswitch™ 

10 technology uses a unique CAPswitch™ oligonucleotide in the first strand cDNA synthesis 
followed by PCR amplification in the second step to generate a high yield of ds cDNA. 
When included in the first-strand cDNA synthesis reaction mixture, the CAPswitch™ 
oligonucleotide serves as a short extended template. When reverse transcriptase stops at the 
5 r end of the mRNA template in the course of first strand cDNA synthesis it switches 

1 5 templates and continues DNA synthesis to the end of the CAPswitch™ oligonucleotide. The 
resulting ss cDNA incorporates at the 3' end, sequence which is complimentary to complete 
5* end of the mRNA and the CAPswitch™ oligonucleotide sequence. 

Of particular interest as the CAPswitch™ oligonucleotide are oligonucleotides 
having the following formula: 

20 

5 , -dNi-dN2-...dNm-rNl-rN2...rNn-3 , 

wherein: 

dN represents a deoxyribonucleotide selected from among dAMP, dCMP, dGMP and 

25 dTMP; 

m represents an integer 0 and above, preferably from 10 to 50; 
rN represents a ribonucleotide selected from the group consisting of AMP, CMP, 
GMP and UMP, preferably GMP; and 

n represents an integer 0 and above, preferably from 3 to 7. 

30 

The structure of the CAPswitch™ oligonucleotide may be modified in a number of 
ways, such as by replacement of 1 to 10 nucleotides with nucleotide analogs, incorporation 
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of terminator nucleotides, such as 3*-amino NMP. 3'-phosphate NMP and the like, or 
non-natural nucleotides which can improve efficiency of the template switching reaction but 
still retain the main function* of the CAPswitch™ oligonucleotide Le. CAP-depended 
extension of full-length cDNA by reverse transcriptase using CAPswitch™ oligonucleotide 
5 as a template. 

In using the CAPswitch™ oligonucleotide, first strand cDNA synthesis is carried out 
in the presence of a set of gene specific primers and a CAPswitch™ oligonucleotide, where 
the gene specific primers have been modified to comprise an arbitrary anchor sequence at 
their 5' ends. The first strand cDNA is then combined with primer sequences complementary 

1 0 to: (a) all or a portion of the CAPswitch™ oligonucleotide and (b) the arbitrary anchor 
sequence of the gene specific primers and additional PCR reagents, such as dNTPs, DNA 
polymerase, and the like, under conditions sufficient to amplify the first strand cDNA. 
Conveniently, PCR is carried out in the presence of labeled dNTPs such that the resultant, 
amplified cDNA is labeled and serves as the labeled or target nucleic acid. Labeled nucleic 

1 5 acid can also be produced by carrying out PCR in the presence of labeled primers, where 

either or both the CAPswitch™ oligonucleotide complementary primer and anchor sequence 
complementary primer may be labeled. In yet an alternative embodiment, instead of 
producing labeled amplified cDNA, one may generate labeled RNA from the amplified ds 
cDNA, e.g. by using an RNA polymerase such as E.coli RNA polymerase, or other RNA 

20 polymerases requiring promoter sequences, where such sequences may be incorporated into 
the arbitrary anchor sequence. 

Instead of using the set of gene specific primers in the first strand cDNA synthesis 
step followed by subsequent amplification of only a representative fraction of the total 
number of distinct RNA species in the sample, one may also amplify all of the RNAs in the 

25 sample and use the set of gene specific primers to generate labeled nucleic acid following 
amplification. This embodiment may find use in situations where the RNA of interest to be 
amplified is known or postulated to be in small amounts in the sample. 

In this embodiment, first strand synthesis is carried out using: (a) an oligo dT primer 
that usually comprises an arbitrary anchor sequence at its 5' end and (b) a CAPswitch™ 

30 oligonucleotide. During first strand synthesis the oligo(dT) anneals to the poly A tail of the 
mRNA in the sample and synthesis extends beyond the 3* end of the RNA to include the 
CAPswitch™ oligonucleotide, yielding a first strand cDNA comprising an arbitrary 
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sequence at its 5' end and a region complementary to the CAPswitch™ oligonucleotide at its 
3' end. The length of the dT primer will typically range from 15 to 30 nts, while the arbitrary 
anchor sequence or portion of the primer will typically range from 15 to 25 nt in length. 

Following first strand synthesis, the cDNA is amplified by combining the first strand 
5 cDNA with primers that correspond at least partially to the anchor sequence and the 

CAPswitch™ oligonucleotide under conditions sufficient to produce an amplified amount of 
the cDNA. Labeled nucleic acid is then produced by contacting the resultant amplified 
cDNA with a set of gene specific primers, a polymerase and dNTPs, where at least one of the 
gene specific primers and dNTPs are labeled. 

10 When employed to generate target, as described above, the gene specific primers of 

the sets of primers according to the subject invention are typically chosen according to a 
number of different criteria. In some embodiments of the invention, primers of interest for 
inclusion in the set include primers corresponding to genes which are typically differentially 
expressed in different cell types, in disease states, in response to the influence of external 

1 5 agents, factors or infectious agents, and the like. In other embodiments, primers of interest 
are primers corresponding to genes which are expected to be, or already identified as being, 
differentially expressed in different ceil, tissue or organism types. Preferably, at least 2 
different gene functional classes will be representedin the sets of gene specific primers, 
where the number of different functional classes of genes represented in the primer sets will 

20 generally be at least 3, and will usually be at least 5. Gene functional classes of interest 

include oncogenes; genes encoding tumor suppressors; genes encoding cell cycle regulators; 
stress response genes; genes encoding ion channel proteins; genes encoding transport 
proteins; genes encoding intracellular signal transduction modulator and effector factors; 
apoptosis related genes; DNA synthesis/recombination/repair genes; genes encoding 

25 transcription factors; genes encoding DNA-binding proteins; genes encoding receptors, 
including receptors for growth factors, ehemokines, interieukins, interferons, hormones, 
neurotransmitters, cell surface antigens, cell adhesion molecules etc.; genes encoding cell- 
cell communication proteins, such as growth factors, cytokines, chemokines, interieukins, 
interferons, hormones etc.; and the like. Less preferred are gene specific primers that are 

30 subject to formation of strong secondary structures with less than -lOkcal/mol; comprise 
stretches of homopolymeric regions, usually more than 5 identical nucleotides; comprise 
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more than 3 repetitive sequences; have high, e.g. more than 80%, or low, e.g. less than 30%, 
GC content etc. 

The particular genes represented in the set of gene specific primers will necessarily 
depend on the nature of physiological source from which the RNAs to be analyzed are 

5 derived. For analysis of RNA profiles of eukaryotic physiological sources, the genes to 
which the gene specific primers correspond will usually be Class II genes which are 
transcribed into RNAs having 5' caps, e.g. 7-methyl guanosine or 2,2,7-trimethylguanosine. 
where Class II genes of particular interest are those transcribed into cytoplasmic mRNA 
comprising a 7-methyl guanosine 5' cap and a polyA tail. 

10 For analysis of RNA profiles of mamma'ian physiological sources, of particular 

interest are gene specific primers corresponding to the functional gene classes listed above. 
For analysis of RNA profiles of human physiological sources, the gene specific primers of 
particular interest are the gene specific primers identified in Table 1 as SEQ ID NO:01 to 
SEQ ID NO:1372, of U.S. Application Serial No. 08/859,998, the disclosure of which is 

1 5 herein incorporated by reference, where sets of these primers will usually include at least 20 
and more usually at least 50 of these specific sequences. 

Particular sets of primers of interest in the subject invention are those sets of primers 
that include primers capable of amplifying at least a portion of the unique polynucleotides 
present on the arrays with which the target is to be employed. By at least a portion is meant 

20 at least about 1 0, usually at least about 20 and more usually at least about 25 number % 
(where number is the number of different unique polynucleotides on the array). For 
examples, sets of primers that include primers capable of amplifying at least a portion of the 
unique polynucleotides listed in Table I, supra, are of interest. Similarly sets of primers 
capable of amplifying at least a portion of the unique polynucleotides listed in Tables 2 to 8, 

25 supra, are also of interest. 

In a particularly preferred embodiment, the gene specific primers are preferably those 
primers that correspond to the different polynucleotide spots on the array that is used in the 
hybridization assay. Thus, one will preferably employ gene specific primers for each 
different polynucleotide that is present on the array, so that if the gene is expressed in the 

30 particular cell or tissue being analyzed, labeled target will be generated from the sample for 
that gene. In many embodiments in which the subject arrays are employed, the gene specific 
primers used to generate the target from the human cell or tissue being analyzed will have 
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the same sequence as the gene specific primers used to generate the polynucleotide probes 
present on the array. In this manner, if a particular gene present on the array is expressed in a 
particular sample, the appropriate target will be generated and subsequently identified. 
Representative sets of primers falling within this particularly preferred embodiment include: 
SET DESCRIPTION 



i 



t pair of primers capable of amplifying each 
polynucleotide listed in Table I, supra, as well 
one set of primers capable of amplifying each of 
the complementary sequences of each of the 
polynucleotides listed in Table I. 

1 pair of primers capable of amplifying each 
polynucleotide listed in Table 2, supra, as well 
one set of primers capable of amplifying each of 
the complementary sequences of each of the 
polynucleotides listed in Table 2. 

1 pair of primers capable of amplifying each 
polynucleotide listed in Table 3 t supra, as well 
one set of primers capable of amplifying each of 
the complementary sequences of each of the 
polynucleotides listed in Table 3. 



1 0 Hybridization and Detection 

As mentioned above, following preparation of the target nucleic acid from the tissue 
or cell of interest, the target nucleic acid is then contacted with the array under hybridization 
conditions, where such conditions can be adjusted, as desired, to provide for an optimum 
level of specificity in view of the particular assay being performed. Suitable hybridization 

1 5 conditions are well known to those of skill in the art and reviewed in Maniatis et al, supra 
and WO 95/21944. In analyzing the differences in the population of labeled target nucleic 
acids generated from two or more physiological sources using the arrays described above, 
each population of labeled target nucleic acids are separately contacted to identical probe 
arrays or together to the same array under conditions of hybridization, preferably under 

20 stringent hybridization conditions (for example, at 50°C or higher and 0.1XSSC (15 mM 
sodium chloride/01.5 mM sodium citrate)) , such that labeled target nucleic acids hybridize 
to complementary probes on the substrate surface. 

Where all of the target sequences comprise the same label, different arrays will be 
employed for each physiological source (where different could include using the same array 

25 at different times). Alternatively, where the labels of the targets are different and 



-148- 



WO 98/53103 



PCT/US98/10561 



distinguishable for each of the different physiological sources being assayed, the opportunity 
arises to use the same array at the same time for each of the different target populations. 
Examples of distinguishable labels are well known in the art and include: two or more 
different emission wavelength fluorescent dyes, like Cy3 and Cy5, two or more isotopes 
5 with different energy of emission, like 32 P and 33 P, labels which generate signals under 

different treatment conditions, like temperature, pH, treatment by additional chemical agents, 
etc., or generate signals at different time points after treatment. Using one or more enzymes 
for signal generation allows for the use of an even greater variety of distinguishable labels, 
based on different substrate specificity of enzymes (alkaline phosphatase/peroxidase), 
10 Following hybridization, non-hybridized labeled nucleic acid is removed from the 

support surface, conveniently by washing, generating a pattern of hybridized nucleic acid on 
the substrate surface. A variety of wash solutions are known to those of skill in the art and 
may be used. 

The resultant hybridization patterns of labeled nucleic acids may be visualized or 
15 detected in a variety of ways, with the particular manner of detection being chosen based on 
the particular label of the target nucleic acid, where representative detection means include 
scintillation counting, autoradiography, fluorescence measurement, colorimetric 
measurement, light emission measurement and the like. 

Following detection or visualization, the hybridization patterns may be compared to 
20 identify differences between the patterns. Where arrays in which each of the different probes 
corresponds to a known gene are employed, any discrepancies can be related to a differential 
expression of a particular gene in the physiological sources being compared. 

Utility 

25 The subject methods find use in, among other applications, differential gene 

expression assays. Thus, one may use the subject methods in the differential expression 
analysis of: (a) diseased and normal tissue, e.g. neoplastic and normal tissue, (b) different 
tissue or tissue types; (c) developmental stage; (d) response to external or internal stimulus; 
(e) response to treatment; and the like. The subject arrays therefore find use in broad scale 

30 expression screening for drug discovery and research, such as the effect of a particular active 
agent on the expression pattern of genes in a particular cell, where such information can be 
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used to reveal drug toxicity, carcinogenicity, etc., environmental monitoring, disease 
research and the like. 

Kits 

5 Also provided are kits for performing analyte binding assays using the subject 

devices, where kits for carrying out differential gene expression analysis assays are 
preferred. Such kits according to the subject invention will at least comprise the subject 
arrays. The kits may further comprise one or more additional reagents employed in the 
various methods, such as primers for generating target nucleic acids, including a set of gene 

10 specific primers according to the subject invention, e.g. primer sets 1 to 9 described above, 
dNTPs and/or rNTPs, which may be either premixed or separate, one or more uniquely 
labeled dNTPs and/or rNTPs, such as biotinyiated or Cy3 or Cy5 tagged dNTPs, or other 
post synthesis labeling reagent, such as chemically active derivatives of fluorescent dyes, 
enzymes, such as reverse transcriptase \ DNA polymerases, and the like, various buffer 

15 mediums, e.g. hybridization and washing buffers, prefabricated probe arrays, labeled probe 
purification reagents and components, like spin columns, etc., signal generation and 
detection reagents, e.g. streptavidin-alkaline phosphatase conjugate, chemifluorescent or 
chemiluminescent substrate, and the like. 

20 

The following examples are offered by way of illustration and not by way of 
limitation. 

EXPERIMENTAL 

25 

F.xa pipte 1 - Gener ation of human cDN A array 

686 cDNA fragments corresponding 686 different human genes were amplified from 
quick-clone cDNA (CLONTECH) in 686 separate test tubes using a combination of sense 
and antisense gene-specific primers: (Set No, 9, described supra). Amplification was 
30 conducted in a 100-ul volume containing 2 ui of mixture of 10 Quick-clone cDNA from 
placenta, brain, liver, lung, leukocytes, spleen, skeletal muscle, testis, kidney and ovary 
(CLONTECH), 40 mM Tricine-KOH (pH 9.2 at 22°C), 3.5 mM Mg(OAc),, 10 mM KOAc, 
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75 |ig/mi BSA, 200 of each dATP, dGTP, dCTP and dTTP, 0.2 \iM of each sense and 
antisense gene-specific primers and 2 ul of KlenTaq Polymerase mix. Temperature 
parameters of the PCR reactions were as follows: I min at 95°C followed by 20-35 cycles of 
95 °C for 15 sec and 68°C for 2 min; followed by a 10-min final extension at 68 °C. PCR 
5 products were examined on 1 .2% agarose/EtBr gels in I x TBE buffer. As a DNA size 

marker a 1 Kb DNA Ladder was used, ds cDNA was then precipitated by addition of a half 
volume of 4M ammonium acetate (about 35 \xl) and 3.7 volumes of 95% ethanol (about 260 
Hi). After vortexing, the tube was immediately centrifuged at 14,000 r.p.m. in a 
microcentrifuge for 20 min. The pellet was washed with 80% ethanol without vortexing, 

10 centrifuged as above for 10 min, air dried, and dissolved in 1 0 jllI of deionized water. Yield 
of ds cDNA after the amplification step was about 5 ^g. The ds cDNA fragments for ail 686. 
genes were cloned into TA-cloning vector using the manufacturer's recommendations 
(Invitrogen) and identity of the clones was confirmed by sequence analysis. The ds cDNA 
inserts with the sequence corresponding 686 genes were amplified by PCR using a 

1 5 combination of antisense and sense gene-specific primers, as described above. The ds cDNA 
was denatured by adding 1 \il of 10X denaturing solution (1 M NaOH, 10 mM EDTA) and 
incubating at 65°C for 20 min. All cDNA probes were transferred in 384-weli plate and 
loaded on positively charged nylon membrane (Schieher & Schull) using 384 pin tool and 
Biomek 2000 (Beckman) robot. The resultant array is described in Table 1. 

20 

Example 2 - Generation of 33 P-labeled oligonucleotides during fi rst strand cDNA 
synthesis 

Step A. cDNA synthesis/Labeling Procedure 
25 I jig of polyA+RNA or total RNA was converted into 33 P-labeled first-strand cDNA 

as follows. A sufficient volume of master mix for all labeling reactions and 1 extra reaction 
was prepared as follows to ensure sufficient volume. For each 10-jil labeling reaction, the 
following reagents were mixed: 

2 ul 5X First-strand buffer (250 uM Tris-HCl pH8.3; 375 mM KCl; 15 mM MgCl 3 ) 
30 I \x\ lOXdNTP mix (500 uM dGTP, 500 uM dCTP, 500 \iM dTTP, 5 uM dATP) 

4 jil [a- "PJdATP (Amersharn, 3000 Ci/mmol. 1 0 mCi/ml) 
\ til MMLV reverse transcriptase (Amersham, 200 units/jil) 
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8 fit Final volume 
Next, the following reagents were combined in a 0.5-mI PCR test tube: 
I ^ g (1-2 m0 polyA+RNA sample 
5 | M | 10x gene-specific primers mix ( 0.2 \xM of each oligonucleotide ID No, 

2,4,6,8,10,12 1372 from Table I of U.S. Patent Application Serial No. 

08/859,998* the discosure of which is herein incorporated by reference.) 



10 As a control, in separate test tube were mixed 1 ug of polyA+RNA sample with 1 ul of oligo 
dT primer (CDS1, 5 r -d(TCTAGAATTCAGCGGCCGC(T) 30 VN) - 3' 

(where V=G or A or C; TsNG or A or T or C) 

For each tube, ddH 2 0 was added to a final volume of 3 ul and the contents were 
1 5 mixed and spun briefly in a microcentrifuge. The tubes were then incubated in a preheated 
PCR thermocycler at 70°C for 2 min. The temperature in thermocycle was reduced down to 
50°C and the tube contents were incubated for 2 min. 8 ul of master mix as prepared above 
were added to each reaction test tube. The contents of the test tubes were then mixed by 
gentle pipetting. The tubes were then incubated in a PCR thermocycler for 20 min at 50°C. 
20 The reaction was then stopped by adding 1 uJ of 1 OX termination mix (0. 1 M EDTA, 1 
mg/ml glycogen). 

Step B. Column Chromatography 

The 32 P-labeled cDNAs were separated from unincorporated 32 P-Iabeled nucleotides 

25 and small (<0.1- kb) cDNA fragments using the following procedure for each test tube. A 
CHROMA SPIN-200 column (CLONTECH, Palo Alto, CA) was placed into a 1.5-ml 
microcentrifuge tube, the water was allowed to drain through the column by gravity flow 
until the surface of the gel beads emerged in the column matrix. The sample was then 
applied to the center of the gei bed's flat surface and allowed to be fully absorbed into the 

30 resin bed. 25 ul of ddH.O were then applied and allowed to completely drain out of the 
column. 200 ul of ddH.O were then applied and allowed to completely drain out of the 
column until there was no liquid left above the resin bed. The column was then transferred to 
a clean l.5-ml microcentrifuge tube. 
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To collect the first fraction, 100 jjl! of ddH 3 0 were added to the column and allowed 
to completely drain out of the column. The second, third and fourth fractions were collected 
in analogous fashion. The tubes with fractions 1-4 were then placed in scintillation counter 
empty vials, and Cherenkov counts for each fraction were obtained in the tritium channel. 
5 The fractions which showed the highest Cerenkov counts were pooled. 

Example 3 - feneration o f f!v3-labeled hybridization polynw»TOtide target from 
polvA+RNA using postsv n thesis labelling procedure 

10 In this procedure for generating labeled cDNA target, polyA+RNA is first converted 

into cDNA that has primary amino groups which are subsequently coupled with Cy3 
succinimide ester. This technology allows for a significant increase (about 10 fold) in 
activity of labeled polynucleotide target and therefore increases the overall sensitivity of 
detection * ck gene expression. The same procedure can be used for labeling two (or more) 

1 5 samples of RNA. In this case the cDNA synthesis step was the same for both samples but at 
the labeling step, each cDNA sample was labeled by different and distinguishable labels, e.g. 
Cy3 and Cy5, Alexa 532 and Bodipy TR, Fluorescein and tetramethyl rhodamine, etc. Each 
labeled probe was purified separately by column chromatography and, after normalization, 
were combined together in equal ratio and hybridized with a cDNA array. After 

20 hybridization, the detection procedure revealed both dye-labeled hybridized target 

simultaneously, based on the different spectral characteristics (emission wavelength) of the 
fluorescent labels. 

A. cDNA synthesis 
25 The 1(M reaction described below converted 1 fxg of polyA+RNA into amino- 

modified first-strand cDNA. 

For each cDNA synthesis reaction: 

1 . Enough master mix for all labeling reactions and 1 extra reaction was prepared to 
ensure sufficient volume. 
30 For each 10-^1 labeling reaction, the following reagents were mixed: 

2 m! 5X First-strand buffer (250 mMTHs-HC! pH8.3: 375 mM KCI; 15mMMgC12) 
1 u\ lOXdNTP mix (500 /.M dGTP, 500 juM dCTP, 500 /iM dATP, 100 M M dTTP, 
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and 100 vM allylamino dUTP ) 
1 M l [oc- n P]dATP (Amersham, 3000 Ci/mmol, 10 mCi/ml) 
3 H 2 0 

I fx\ MMLV reverse transcriptase (Amersham, 200 units/ul) 



8 fx\ Final volume 



2. The following was combined in a 0.5-ml PCR test tube: 
I Mg ( t -2 m0 polyA+RN A sample 

10 I /il I Ox gene-specific primers mix ( 0.2 uM of each oligonucleotide ID No. 

2,4,6,8,10,12 1372) (from Table ! of U.S. Patent Application No. 

08/859,998, the disclosure of which is herein incorporated by reference.) 

As a control in separate test tube 1 of polyA+RNA sample was mixed with I & 
1 5 of oligo dT primer (SEQ ID NO. 1373 from Table I of U.S. Application No. 08/859,998). 

3 . ddH 2 0 was added to a final volume of 3 /A 

4. The contents were mixed and the tubes were spun briefly in a microcentrifuge. 

5. The tubes were incubated in preheated PCR thermocycler at 70°C for 2 min. 

20 6. The temperature in the thermocycle was reduced down to 50 °C and incubate for 2 
min. 

7. 8 tA of master mix were added to each reaction test tube. 

8. The contents of the test tubes were mixed by gentle pipeting. 

9. The tubes were incubated in a PCR thermocycler for 30 min at 50°C. 

25 10. The reaction was stopped by increasing temperature up to 70°C for 5 min, then 
cooled to 37°C. 

1 L M of RNase H (10 units/^O was added and the tubes were incubated at 37°C for 15 



mm. 



12. The reaction was stopped by adding 40 »l of termination mix (0.3 M sodium acetate. 

30 pH 5.0, 1 mMEDTA). 

13. An equal volume (50 /A) of phenol/chlorophorm/isoamyl alcohol mix (I: 1: 1/24 v/v) 
was added and extraction was performed. Phases were separated by centrifugation at 
14.000 rpm for 10 min. 
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10 



14. Upper water phase was collected and cDNA was precipitated by adding 2.5 volumes 
(about 120 m0 of ethanol. 

15. The precipitate was collected by centrifugation at 1 4,000 rpm for 1 0 min, the 
supernatant removed and the precipitate was washed with 80% ethanol. 

16. The precipitate was air dried and dissolved in 10 iA of 0. 1 M sodium bicarbonate 
buffer, pH 9.0. 

Step B. Post synthesis labeling procedure. 

1. 1 mg of Cy3 succinimide ester was dissolved in 10 a*1 of dimethyl sulfoxide and 10 
Ail of amino-modified cDNA generated at step 16 was added to it. 

2. The mixture was incubated at room temperature overnight. 



StepC. Column Chromatography 

To purify the Cy3-labeled cDNAs from the unconjugated label, the following was 

1 5 performed for each test tube: 

1 . CHROMA SPIN-200 column (CLONTECH) was removed from refrigerator and 
allowed to warm at room temperature for about 1 hour. The column was inverted 
several times to completely resuspend the gelmatrix. (Note: Check for air bubbles in 
the column matrix. If bubbles are visible, resuspend the matrix in the in the column 

20 buffer (ddH 2 0) by inverting the column again). 

2. The bottom cap from the column was removed, and then the top cap was slowly 
removed. 

3. The column was placed into a 1 .5-ml microcentrifuge tube. 

4. The water was allowed to drain through the column by gravity flow until the surfaces 
25 of the gel beads in the column matrix were visible. (The top of the column matrix 

should be at the 0.75-ml mark on the wall of the column. If the column contains 
much less matrix, adjust the volume of the matrix to 0.75ml mark using matrix from 
another column.) 

5. The collected water was discarded. 

30 6. The sample was applied to the center of the gel bed's flat surface and allowed to be 
fully absorbed into the resin bed. Care was taken not allow any sample to flow along 
the inner wall of the column. 
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7. 25 jul of ddH.O were applied and allowed to completely drain out of the column. 

8. Apply 200 jul of ddH 3 0 and allow the buffer to completely drain out of the column 
until there was no liquid left above the resin bed. 

9. The column was transfered to a clean 1 .5-ml microcentrifuge tube. 

5 10. 100 /^l of ddH 2 0 were added to the column and allowed to completely drain out of the 
column. 

1 1 . The second, third and fourth fractions were collected by repeating steps 9-10. 

12. Cherenkov counts were obtained for each fraction by counting the entire sample in 
the tritium channel. 

10 13. The fractions (usually fractions 2-3) which showed highest Cerenkov counts were 

pooled. Waste column and the fractions (usually fraction 1 and 4) which showed less 
than 1 0% counts from peak fractions. 



Pimp le 4 ~ Hybridiza tion 32 P-labeled cDNA Target with cDNA Army 

15 

A solution of ExpressHyb™ (CLONTECH) and sheared salmon testes DNA (Sigma) 
was prepared by prewarming 15 ml of ExpressHyb™ at 50-60°C, heating 1.5 mg of sheared 
salmon testes DNA at 95-100°C for 5 min followedby chilling quickly on ice, and 
combining the resultant heat-denatured sheared salmon testes DNA with the prewarmed 

20 ExpressHyb™. 

A cDNA Array as produced in Example 1 above was then placed in a hybridization 
bottle and 10 ml of the solution prepared above was added to the bottle. Prehybridization 
was performed for 30 min with continuous agitation at 72 °C. Labeled cDNA probe 
(Example 1, about 200 ul, total about 2-5xl0 6 cpm) with l/10th of the total volume ( about 

25 22 ul) of I Ox denaturing solution (1 M NaOH, 10 mM EDTA) was mixed and incubated at 
65°C for 20 min. 5 ul (1 fig/ul) of human Cot- 1 DNA was then added, and an equal volume 
(about 225 u.1) of 2x Neutralizing solution ( 1 M NaHP04, pH 7.0) was added and incubation 
continued at 65 °C for 10 min. The mixtures were then combined and thoroughly mixed. 

The prehybridization solution was replaced with the solution comprising the labeled 

30 oligonucleotide as prepared above and allowed to hybridize overnight with continuous 

agitation at 65 °C. Following hybridization, the hybridization solution was carefully removed 
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and discarded, replaced with 200 ml of Wash Solution 1 (2X SSC. 1% SDS). The array was 
washed for 20 min with continuous agitation at 65 °C. Washing was repeated four times. 

Two additional 20-min washes were then performed in 200 mi of prewarmed Wash 
Solution 2 (0.1X SSC, 0.5% SDS) with continuous agitation at 65 °C. Using forceps, the 
5 cDNA array was removed from the container and excess wash solution was removed by 
shaking. 

The damp membrane was immediately wrapped in plastic wrap, mounted on 
Whatman paper (3mm Chr) and exposed to x-ray film at -70°C with an intensifying screen. 

10 Example 5 -Comparison B e tween Using Sets of Gene Specific Primers and oligo dT 

32 P-labeled cDNA target were synthesized by M-MLV reverse transcriptase from a 
mixture 588 antisense gene-specific primers (B) or oligo dT(A) using placenta polyA+RNA 
as a template as described in Example 2. Primer extension products generated by reverse 

1 5 transcription were purified by gel filtration as described in Example 2 and hybridized 

separately with two cDNA arrays comprising 588 human genes under identical conditions as 
described in Example 4. Signals which can be detected by using cDNA target generated 
using the set of gene specific primers but can not be detected by using conventional target 
generated with oligo dT primers were observed. Note, the level of non-specific background 

20 detected as signal generated by membrane alone outside of the regions with immobilized 
probes generated by target generated using oligo dT primers was significantly higher in 
comparison with the background generated by the target generated by using the sets of gene 
specific primers. 

25 Example 6 - Generation of cDNA a rray pr o be immobi lized on Rlass slides, 

50 cDNA fragments corresponding to 50 different human genes were amplified from 
plasmid clones containing corresponding cDNA fragments in 96 well plates using 
combination of vector primer ID No. 1376 and ID No. 1377 or sense and antisense gene- 
specific primers: ID No. 1+2, 3+4,5+6,7+8,.... 100+101 (from Table 1 of U.S. Patent 

30 Application No. 08/859,998, the disclosure of which is herein incorporated by reference). 
Amplification was conducted in a 400-jul volume containing 2 ng of plasmid DNA, 40 miM 
Tricine-KOH (pH 9.2 at 22°C), 3.5 mM Mg(OAc),, 10 MM KOAc, 75 ^g/ml BSA, 200 uM 
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of each dATP, dGTP, dCTP and dTTP, 0.2 of each primers and 2 iA of KlenTaq 
Polymerase mix (CLONTECH). Temperature parameters of the PGR reactions were as 
follows: 1 min at 95°C followed by 30 cycles of 95°C for 15 sec and 68°C for 2 min; 
followed by a 10-min final extension at 68°C. PCR products were examined on 1.2% 
5 agarose/EtBr gels in I x TBE buffer. As a DNA size marker , a 1 Kb DNA Ladder was used, 
ds cDNA was then precipitated by addition of a 10% volume of 3M sodium acetate (pH 5-0) 
(about 40 jul) and 2.5 volumes of 96% ethanol (about 1 ml). After vortexing, the tube was 
immediately centrifuged at 14,000 r.p.m. in a microcentrifuge for 20 min. The pellet was 
washed with 80% ethanol without vortexing, centrifuged as above for 10 min, air dried, and 

10 dissolved in 10 pel of deionized water. Yield of ds cDNA after amplification step was about 
20 Mg. The ds cDNA was solved in 10 iA of distilled water, 10 jA of 1 M sodium carbonate 
buffer, pH 9.5, was added and the ds cDNA was denaturated by heating at 94°C for 5 min 
and cooled down. The treated glass slides were prepared as following: Glass slides were 
cleaned overnight in 25% solution of nitric acid at room tempera) 'ire, washed 3 times by 

15 acetone, treated with 1% aminopropyl-trimethoxysilane for 3 hrs at room temperature, 
washed two times with acetone, heated at 120° C for 6 hrs and then treated with 0.2 % of 
para-phenylendiisothiocyanate (95:5 acetone-water solution) at room temperature for 3 hrs, 
then washed two times by acetone and dried in vacuum with desiccant. All cDNA probes 
were transferred in 384-well plate and printed on treated glass slides using 384 pin tool and 

20 Biomek 2000 (Beckman) robot. After printing, the arrays were incubated in wet chamber at 
37°C overnight, then ultraviolet-cross linked to the surface by subjecting the slides to 30 mJ 
of energy (Stratagene Stratalinker). The arrays were washed with 1% of sodium borohydrate 
in 0.1 M NaOH, then washed 3 times in distilled water, dried in vacuum and stored with 
desiccant. 

25 

Example 7- Hybridization Cv3 -labeled cDNA Target (or Cv3/CvS labeled cQNA 
tar gets') with glass cDNA array 

1. A solution of ExpressHyb (CLONTECH) and sheared salmon testes DNA (Sigma) 
30 was prepared as follows: 

a. 5 ml of ExpressHyb™ was prewarmed at 50-60°C. 
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b. 0.5 mg of the sheared salmon testes DNA was heated at 95-100 °C for 5 
min, and then chilled quickly on ice. 

c. Heat-denatured sheared salmon testes DNA was mixed with prewarmed 
ExpressHyb. 

5 2. The glass cDNA array was placed in a hybridization container, and 1 ml of the 
solution prepared in step 1 above was added. 

3. Prehybridization was conducted for 5 min with continuous agitation at 65 °C. 

4. Labeled cDNA probe as prepared in example 3, step C 1 3, above, (about 200 ^1) was 
mixed with 2 iA ( 1 ^g/Ail) of human Cot- I DNA , and denaturated at 99 °C for 2 

10 min. 

5. The mixture prepared in Step 4 was added to the hybridization box from Step 3 and 
the two solutions were mixed together thoroughly. The container was sealed by 
sealing tape. 

6. Hybridization was allowed to proceed overnight with continuous agitation at 65 °C 
1 5 7. The hybridization solution was carefully removed and discarded in an appropriate 

container, and replaced with 10 ml of Wash Solution 1 (2X SSC, 0.1% SDS). The 
array was washed for 10 min with continuous agitation at 65°C. The step was 
repeated two times. 

8. Additional 10-min washes were performed in 10 ml of Wash Solution 2 (0. 1 X SSC, 
20 0. 1 % SDS) with continuous agitation at 65 °C. 

9. Using forceps, the cDNA array was removed from the container, briefly washed in 0. 
1XSSC and excess buffer was removed from surface by centrifugation in a Beckman 
CS-6R centrifuge at 2000 rpm. 

10. Glass arrays were scanned using a custom-built laser scanner equipped by green (Cy3 
25 Chanel) and red ( Cy5 chanel) solid state laser built in UCLA. Images were scanned 

at a resolution of 20 jj,m per pixel. 



It is evident from the above results and discussion that the subject invention provides 
a rapid, high throughput means to simply and quickly obtain a broad-scale screening of gene 
30 expression in a variety of different samples. Only simple hybridization protocols need be 
employed with the subject arrays, and signals can be detected using any convenient and 
readily available detection device. Despite their simplicity, assays conducted with the 
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subject arrays yield a large amount of information regarding the expression of numerous 
different and important genes in a particular sample at substantially the same time, and thus 
have use in many different types of applications, including drug discovery and 
characterization, disease research, and the like. 

5 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
1 0 admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
1 5 those of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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WHAT fS CLAIMED TS: 

L An array comprising a plurality of polynucleotide spots stably associated with the 
surface of a solid support, wherein a portion of said plurality of polynucleotide spots 
comprise a polynucleotide probe composition made up of unique polynucleotides and all of 
5 the unique polynucleotides on said array correspond to genes of a specific type. 

2. The array according to Claim I , wherein said polynucleotides of said array have an 
average length of from about 120 to 1000 nt. 

10 3. The array according to Claims 1 or 2, wherein each of said unique polynucleotides 
does not cross hybridize with the polynucleotides of any other polynucleotide probe 
composition on the array. 

4. The array according to Claims 1 ¥ 2 or 3, wherein said polynucleotide probe 
1 5 composition comprises a population of single stranded identical polynucleotides. 

5. The array according to Claims 1, 2 or 3, wherein said polynucleotide probe 
composition comprises a population of two different complementary single stranded 
polynucleotides. 

20 

6. The array according to any of the preceding claims* wherein the density of spots on 
said array does not exceed about 500/cm 2 . 

7. The array according to any of the preceding claims, wherein the number of spots on 
25 said array ranges from about 50 to 1000. 

8. The array according to any of the preceding claims, wherein said array is selected 
from the group consisting of a human array, a mouse array, a cancer array, an apoptosis 
array, a human stress array, an oncogene/tumor suppressor array, a cell-cell interaction array, 

30 a cytokine and cytokine receptor array, a rat array, a blood array, a mouse stress array, and a 
neuroarray. 
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9. The array according to any of the preceding claims, wherein said solid support is 
flexible. 

10. The array according to any of the preceding claims, wherein said solid support is 
5 rigid. 

11. The array according to any of the preceding claims, wherein said polynucleotide 
probes of said array are those listed in a table selected from the group consisting of: Table 1 , 
Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8. 

10 

12. A method of preparing an array according to any of the preceding claims, said 
method comprising: 

enzymatically generating said unique polynucleotides; and 
stably associating said enzymatically-generated, complementary, unique 
1 5 polynucleotides on the surface of said solid support. 

13. A set of a representative number of distinct gene specific primers comprising gene 
specific primers corresponding to at least twenty distinct genes. 



20 



25 



14. The set of gene specific primers according to Claim 13, wherein at least two of the 
twenty distinct genes are from different gene functional classes. 

15. The set of gene specific primers according to Claim 1 4, wherein the set comprises 
from 20 to 10,000 gene specific primers. 

16. The set of gene specific primers according to Claims 13, 14 or 15, wherein the set 
comprises primers capable of amplifying at least a portion of the polynucleotides present on 
an array according to any of Claims 1 to 1 1. 

30 1 7. The set of gene specific primers according to Claim 1 6, wherein the set comprises 
primers capable of amplifying at least 20 of the polynucleotides present on an array 
according to any of Claims 1 to 1 1 . 
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1 8. A method for detecting expression of a gene using a hybridization assay, said method 
comprising: 

contacting at least one. labeled target polynucleotide sample with an array according 
to any of Claims 1 to 1 1 under hybridization conditions sufficient to produce a hybridization 
5 pattern; and 

detecting said hybridization pattern. 

1 9. The method according to Claim 1 8, wherein said method further comprises washing 
said array prior to said detecting step. 

10 

20. The method according to Claims 1 8 or 19, vvherein said method further comprises 
preparing said labeled target polynucleotide sample. 

21. The method according to Claim 20, wherein said preparation comprises: 
1 5 obtaining a sample of nucleic acids from a physiological source; and 

generating a population of labeled nucleic acids from the nucleic acids sample by 
using a set of a representative number of distinct gene specific primers according to any of 
Claims 13 to 17; 

whereby said labeled target polynucleotide sample is produced. 

20 

22. The method according to Claims 20 or 21, wherein said preparing comprises 
conjugating a detectable label to a functionalized target polynucleotide. 

23. The method according to any of Claims 1 8 to 22, where said method further 
25 comprises: 

generating a second hybridization pattern; and 
comparing said hybridization patterns. 

24. The method according to Claim 23, wherein said hybridization patterns are generated 
30 on the same array. 
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25. The method according to Claim 23, wherein the second hybridization patters are 
generated on different arrays. 

26. A kit for use in a hybridization assay, said kit comprising: 
5 an array according to any of Claims 1 to 1 i . 

27. The kit according to Claim 26, wherein said kit further comprises reagents for 
generating a labeled target polynucleotide sample. 

10 28. The kit according to Claims 27, wherein said reagents comprise a set of a 

representational number of gene specific primers according to any of Claims 13 to 17. 

29. A kit for use in detecting the differential expression of genes of a plurality of 
physiological sources, the kit comprising: 
15 a set of a representative number of distinct gene specific primers according to any of 

Claims 13 to 17. 
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